Overview
Skills
Job Details
Amtex Systems Inc is an information technology and talent solutions company offering talent and BI consulting to the companies in US for over 25 years.
Our solutions are designed to fill resource gaps, by providing the right candidates who deliver value to the organization. Our propensity to nurture and build strong relationships with our clients helps us better understand their business demands and gives us the ability to provide services that are on time and rise above the rest.
Title: Application Support / Site Reliability Engineer (SRE)
Location: New York, NY or Alpharetta, GA (Hybrid: 3 days/week onsite)
Duration: 12+ Months Contract
Interview: Multiple rounds, including a final in-person interview in the office
Role OverviewNon-Financial Risk Technology at client is seeking an Application Support and SRE Specialist for their Application and Data Engineering (ADE) team. The ADE team is responsible for application engineering, tooling, automation, and elevated production support, focusing on performance, reliability, and scalability for critical applications within a secure environment.
The ideal candidate will leverage strong SRE principles and DevOps practices to modernize and stabilize applications, primarily focusing on Linux (RedHat), containerization (OpenShift/Kubernetes), and extensive application/middleware support. Prior experience working at investment banks, investment managers, asset managers, or financial market data companies is mandatory.
Key Responsibilities-
Provide application support, application server administration, and technical troubleshooting for infrastructure and user incidents.
-
Implement Site Reliability Engineering practices by developing automated solutions to minimize downtime and reduce "toil."
-
Design and implement robust web architectures focusing on performance, availability, scalability, and disaster recovery.
-
Configure and maintain application monitors using industry-standard tools and develop customized monitoring solutions.
-
Evaluate and confirm SRE Metrics against firm and department goals.
-
Identify opportunities for automation, resiliency, and observability, building up internal knowledge and documentation.
-
Collaborate with central teams (Infrastructure, Networking, Security, Database) to successfully roll out application platforms.
-
Produce and periodically review reusable infrastructure design patterns.
-
Support the onboarding of vendor technologies in adherence to Morgan Stanley security blueprints.
-
Automate daily support functions and hygiene initiatives to create efficiency and consistency.
-
Occasional weekend availability and on-call work on a rotation basis.
Category | Skills & Experience |
Experience | 7-12 years in a similar hands-on application/middleware specialist role. Prior experience in a global financial organization (Investment Banks, Asset Managers, etc.) is essential. |
Operating System | Strong infrastructure knowledge in Linux / Unix (RedHat expertise highly critical), Storage, Networking, and Databases. |
Containerization | Hands-on experience with containers and orchestration platforms: OpenShift / Kubernetes. |
Automation/Scripting | Proficiency in Python and Shell scripting. |
Web/Middleware | Hands-on experience with web servers (Apache / Nginx), application integration, configuration, and troubleshooting. |
SRE/DevOps | Strong knowledge of SRE Principles, including tools and approaches to apply them; experience in troubleshooting Application Issues and Managing Incidents. |
Monitoring | Exposure to tools like Prometheus, Grafana, and the Open Telemetry framework. |
Architecture | Clear concept of load balancers, web proxies, and storage platforms (NAS / SAN). Experience managing large web-based n-tier applications in secure environments. |
Security | Familiarity with basic security practices, including Single Sign-On (SSO) and standard encryption protocols. |
Soft Skills | Excellent verbal and written communication skills. |
-
Exposure to data pipeline technologies such as Kafka, Redis, and Airflow.
-
Exposure to Big Data platforms like Hadoop / Cloudera and ELK Stack.
-
Experience with capacity planning and performance tuning.
-
Identity management protocols like OIDC / OAuth, SAML, LDAP integration.
-
Cloud application and infrastructure knowledge or certification is a plus.