Overview
Skills
Job Details
About R Systems:
R Systems is a leading digital product engineering company that designs and develops chip-to-cloud software products, platforms, and digital experiences that empower its clients to achieve higher revenues and operational efficiency. Our product mindset and engineering capabilities in Cloud, Data, AI, and CX enable us to serve key players in the high-tech industry, including ISVs, SaaS, and Internet companies, as well as product companies in telecom, media, finance, manufacturing, and health verticals. We Are Great Place to Work Certified in 10 countries with a full-time workforce [India, USA, Canada, Poland, Romania, Moldova, Indonesia, Singapore, Malaysia & Thailand]! We are recognized as one of the Best Tech Brands 2024 by the Times Group and India's Top 500 Value Creators 2023 by Dun & Bradstreet.
Company Link:
Job Description:
Cloud Infrastructure Management
- Design, deploy, and manage scalable infrastructure across Azure and AWS.
- Implement Infrastructure as Code (IaC) using Terraform, Bicep, and CloudFormation.
- Automate provisioning of compute, networking, and storage resources in multi-cloud environments.
Automation & CI/CD
- Build and maintain CI/CD pipelines using GitHub Actions, Azure DevOps, Jenkins, and GitLab CI.
- Automate testing, deployment, rollback, and blue/green or canary releases.
- Integrate IaC workflows into CI/CD for consistent environment provisioning.
Monitoring & Reliability (SRE)
- Implement observability using Prometheus, Grafana, ELK Stack, Azure Monitor, and Alertmanager.
- Define and track SLIs/SLOs for infrastructure, services, and data pipelines.
- Manage incident response, root cause analysis, and postmortems with automated alerting and dashboards.
Security & Compliance
- Automate security scans, policy enforcement, and compliance checks.
- Manage secrets and credentials using AWS Secrets Manager, Azure Key Vault, and HashiCorp Vault.
- Enforce RBAC, network segmentation, and encryption across environments.
Kafka DevOps Responsibilities
Kafka Cluster Management
- Deploy and manage Kafka clusters on Azure (HDInsight, Confluent Cloud) and AWS (MSK, EC2).
- Automate provisioning and scaling using Terraform and Bicep.
- Configure brokers, zookeepers, and replication strategies for high availability.
Kafka CI/CD & GitOps
- Build pipelines for topic creation, ACLs, schema registry updates, and connector deployments.
- Use GitOps workflows to manage Kafka configurations and connector lifecycle.
- Integrate Kafka Connect and Debezium for real-time data ingestion.
Kafka Monitoring & Reliability
- Integrate Kafka metrics into Prometheus, Grafana, and Azure Monitor.
- Define SLIs/SLOs for throughput, latency, consumer lag, and ISR health.
- Implement alerting for broker failures, under-replicated partitions, and lag thresholds.
Kafka Security & Governance
- Automate RBAC/ACLs using Confluent CLI, Strimzi, or Kafka scripts.
- Manage SSL/SASL authentication, certificates, and secret rotation.
- Enforce schema validation and audit logging for compliance.
OpenShift DevOps Responsibilities
Platform Operations
- Manage OpenShift clusters across hybrid and multi-cloud environments (AWS, Azure, on-prem).
- Automate cluster provisioning and upgrades using OpenShift Installer, Terraform, and Ansible.
- Configure Operators, Helm charts, and custom resources for platform extensibility.
CI/CD & Deployment
- Integrate OpenShift Pipelines (Tekton), Jenkins, and ArgoCD for GitOps-driven deployments.
- Automate image builds using Source-to-Image (S2I), BuildConfigs, and container registries.
- Implement progressive delivery strategies (e.g., canary, blue/green) within OpenShift.
Security & Governance
- Enforce RBAC, NetworkPolicies, PodSecurityPolicies, and compliance standards.
- Automate certificate management and secret rotation via Vault and OpenShift Secrets.
- Use OpenShift Compliance Operator for policy enforcement and audit logging.
Monitoring & SRE
- Integrate Prometheus, Grafana, and Alertmanager for cluster and workload observability.
- Track SLIs/SLOs for pod availability, deployment latency, and resource saturation.
- Automate scaling and recovery using Horizontal Pod Autoscalers and readiness/liveness probes.
Kafka on OpenShift
- Deploy Kafka using Strimzi or Confluent Operator within OpenShift.
- Automate topic creation, ACLs, and connector deployment via OpenShift Pipelines.
- Monitor Kafka workloads using OpenShift-native dashboards and Prometheus exporters.
- Frequent Internal Hackathons: Engage in dynamic competitions with exciting prizes to keep your skills sharp.
- Cultural Celebrations: Strengthen our familial bonds through shared celebrations, fostering a sense of community.
- Diverse Project Exposure: Work on a variety of projects across sectors like Healthcare, Banking, e-commerce, and Retail, collaborating with leading global brands.
- Centre of Excellence (COE): Benefit from technical guidance and upskilling opportunities provided by our team of technology experts, helping you navigate your career path.
- E-Learning Platform: Gain access to comprehensive e-learning platforms coupled with a robust mentorship program to enhance your skills.
- Open Door Policy: Embrace a culture of mutual support, respect, and open dialogue, promoting a collaborative work environment.