Overview
Skills
Job Details
Role: Application Support and SRE Engineer
Location: New York, NY 10019 (Hybrid 3 Days Onsite)
Duration: Long-Term Contract
Interview: Video Onsite
Experience: 10 15+ Years
We are seeking a highly skilled Application Support & SRE Engineer . This group provides engineering, automation, containerization, observability, and elevated production support services across multiple business units.
The ideal candidate will have strong experience with Linux, Kubernetes/OpenShift, automation (Shell/Python), CI/CD, web servers, and SRE practices.
This is a hands-on engineering role, NOT a 24/7 production support position.
Key Responsibilities-
Provide application support, troubleshooting, and technical analysis for infrastructure and application issues
-
Contribute to SRE practices including automation, toil reduction, resiliency, and observability
-
Deploy and onboard applications onto Kubernetes/OpenShift container platforms
-
Automate repetitive processes using Shell, Python, and Ansible
-
Work with cross-functional teams: Infrastructure, Security, Network, Storage, Database
-
Implement and update monitoring/alerting using Prometheus, Grafana, OpenTelemetry
-
Ensure applications meet performance, scalability, and disaster recovery requirements
-
Create reusable design patterns and maintain documentation
-
Participate in incident management and post-incident reviews
-
Occasional weekend/on-call rotation as needed
-
10 15 years of hands-on experience as Application Support, Middleware, SRE, or DevOps Engineer
-
Strong Linux/Unix administration & troubleshooting (RHEL 7/8/9)
-
Hands-on experience with Kubernetes or OpenShift
-
Automation using Python, Shell, and Ansible
-
Experience with CI/CD pipelines (GitHub, Jenkins, GitLab CI)
-
Strong understanding of networking, storage (NAS/SAN), load balancers, proxies, SSL
-
Experience with Apache or Nginx for configuration and web server support
-
Experience troubleshooting distributed applications and incident management
-
Knowledge of SRE principles, SLIs/SLOs, error budgets
-
Familiarity with Prometheus, Grafana, OpenTelemetry and monitoring best practices
-
Experience in secure enterprise environments (SSO, OAuth/SAML, encryption standards)
-
Experience with Kafka, Redis, Airflow
-
Big Data platforms: Hadoop, Cloudera, ELK Stack
-
Cloud experience: AWS / Azure / Google Cloud Platform
-
Identity management: OIDC, OAuth, SAML, LDAP
-
Performance tuning, capacity planning
-
Experience in a large financial institution