Position: SRE Engineer (Site Reliability Engineer)
Location: Austin, TX
Duration: Long Term
About R Systems:
R Systems is a leading digital product engineering company that designs and develops chip-to-cloud software products, platforms, and digital experiences that empower its clients to achieve higher revenues and operational efficiency. Our product mindset and engineering capabilities in Cloud, Data, AI, and CX enable us to serve key players in the high-tech industry, including ISVs, SaaS, and Internet companies, as well as product companies in telecom, media, finance, manufacturing, and health verticals.
We Are Great Place to Work Certified in 10 countries with a full-time workforce [India, USA, Canada, Poland, Romania, Moldova, Indonesia, Singapore, Malaysia & Thailand]!
We are recognized as one of the Best Tech Brands 2024 by the Times Group and India's Top 500 Value Creators 2023 by Dun & Bradstreet.
Company Link:
Job Requirement:
Mandatory Skills: Site Reliability Engineer
Role Overview
We are seeking a hands-on Site Reliability Engineer (SRE) with deep expertise in Observability assessments, gap analysis, and solutioning (automation and manual fixes). This role partners closely with development and operations teams to drive enterprise-wide observability and reliability initiatives across all technology domains.
Responsibilities
Design and architect reliable, scalable, high-performance systems with a focus on availability, performance, and operational excellence
Lead observability current-state assessments, target-state design, and gap analysis across application, infrastructure, database, security, middleware, and network domains
Deliver Observability as a Service, including monitoring, dashboards, and alerting as code
Collect and analyze telemetry using Dynatrace, SolarWinds, Cisco, F5, Prometheus, Grafana, Splunk, Kibana, ELK, and AIOps tools
Implement Golden Signals (latency, traffic, errors, saturation) using standardized SLI/SLO frameworks
Instrument applications using OpenTelemetry (OTEL) for Java and .NET
Configure APM, infrastructure monitoring, synthetic monitoring, RUM, log monitoring, and distributed tracing
Integrate Dynatrace with CI/CD pipelines, ITSM systems, alerting tools, and incident automation frameworks
Tune alerts, baselines, and AI-driven anomaly detection (Dynatrace Davis AI) to reduce noise
Deploy and optimize Dynatrace OneAgent, PurePath tracing, Smartscape topology, and OKit modules
Collaborate with application and infrastructure teams to troubleshoot performance issues and implement permanent fixes
Define SRE standards, monitoring best practices, and governance models
Support authentication observability for Ping, ForgeRock, and SiteMinder (sessions and cookies)
Build strong cross-functional relationships and stay current on SRE and observability best practices
Qualifications
7 10 years of hands-on SRE experience across cloud, development, automation, and observability platforms
Strong experience with Observability as Code, dashboard/monitoring/alerting as code
Deep expertise with Dynatrace APM and observability tooling (Prometheus, Grafana, Splunk, ELK)
Hands-on experience with AWS (Control Tower, account setup, RDS, SSO)
Strong experience with Docker, Kubernetes, Linux, GitLab CI/CD, and Terraform
Advanced experience with APM, distributed tracing, synthetic & real user monitoring, and log analytics
Extensive hands-on experience with OpenTelemetry instrumentation
Experience integrating observability platforms with AWS, Azure, and Google Cloud Platform
Proven ability to define enterprise monitoring standards and scalable observability frameworks
Why Join R Systems?
Frequent Internal Hackathons: Engage in dynamic competitions with exciting prizes to keep your skills sharp.
Cultural Celebrations: Strengthen our familial bonds through shared celebrations, fostering a sense of community.
Diverse Project Exposure: Work on a variety of projects across sectors like Healthcare, Banking, e-commerce, and Retail, collaborating with leading global brands.
Centre of Excellence (COE): Benefit from technical guidance and upskilling opportunities provided by our team of technology experts, helping you navigate your career path.
E-Learning Platform: Gain access to comprehensive e-learning platforms coupled with a robust mentorship program to enhance your skills.
Open Door Policy: Embrace a culture of mutual support, respect, and open dialogue, promoting a collaborative work environment.
If you are passionate and excited about working in a fast-paced, innovative environment, we would love to hear from you!
R Systems is an equal opportunity employer that does not discriminate against any employee or job applicant because of race, color, religion, national origin, sex, physical or mental disability, age, or any other characteristic protected by law. We strive to build a team that reflects the diverse communities we serve, and we actively encourage applications from individuals of all backgrounds and experiences. Our commitment to equal opportunity extends to all aspects of employment, including recruitment, hiring, training, promotion, and benefits.
#LI-RC1