SRE Engineer - Onsite interview / Only New York candidate needed - Zealogics

Overview

On Site

Depends on Experience

Accepts corp to corp applications

Contract - Independent

Contract - W2

Contract - 12 Month(s)

No Travel Required

Skills

ResponsibilitiesThe successful candidate will: The successful candidate will be involved in application support

application server administration

technical troubleshooting of infrastructure and user incidents Incorporate Site Reliability Engineering practices into the day-to-day role by developing automated solutions to long-standing problems to ensure minimal downtime and reduce toil Experience with web architecture implementation including performance

availability

scalability

and disaster recovery planning. Experience with monitoring and alerting tools

configuring application monitors using industry standard monitoring tools

as well as developing customized monitoring solutions Revisit SRE Metrics and confirm against the firm and department goals Identify areas for improvement including automation

toil reduction

resiliency and observability across the platforms and help build up the knowledge and documentation for the team Partner with other teams in such as enterprise infrastructure

networking

security

storage

and database and data center to roll out application platforms successfully as per the design. Produce reusable infrastructure designs patterns and periodically review / refresh the patterns. Support vendor / vendor technology onboarding following the best practices and security blueprint. Apply technical skills to automate daily support functions

improve system stability

support hygiene initiatives and deliver innovation that creates efficiency and consistency. Occasional weekend availability and on-call work on a rotation basis.Required Skills Strong infrastructure knowledge in Linux / Unix

Databases

Storage and Networking technologies. Hands-on experience with containers and container orchestration platforms OpenShift / Kubernetes Experience with scripting in Python and Shell Hands-on experience of web servers (Apache / Nginx)

application integration

configuration

and troubleshooting. Clear concept of load balancer

web proxies and storage platforms like NAS / SAN from an implementation perspective only. Familiar with basic security practices to ensure secure hosting solutions

including single sign-on (SSO) and standard encryption protocols. Prior experience managing large web-based n-tier applications in secure environments on cloud Strong knowledge SRE Principles with grasp over tools / approach to apply them Strong infrastructure knowledge in Storage

Networking and Databases Experience in troubleshooting Application Issues and Managing Incidents Exposure to tools like Prometheus

Grafana

and Open Telemetry framework Excellent verbal and written communication skills.Desired / Nice to have skills Exposure and experience with data pipeline technologies such as Kafka

Redis and Airflow Exposure to Big Data platforms like Hadoop / Cloudera and ELK Stack Capacity planning and performance tuning exercise Identity management protocols like OIDC / OAuth

SAML

LDAP integration Cloud Application and infrastructure knowledge is a plus. Experience in Cloud / Distributed computing technology or certification is a plusExperience 7 to 12 years in a similar role of hands-on application / middleware specialist. Prior experience of working in a global financial organization is an advantage

SRE Engineer

Site Reliability Engineer

Site Reliability

Apache HTTP Server

Apache Hadoop

Apache Kafka

Application Servers

Cloudera

Communication

Computer Networking

Database

Disaster Recovery

Distributed Computing

Documentation

Encryption

Extract

Transform

Load

Finance

Hosting

Identity Management

Innovation

Kubernetes

LDAP

Linux

Load Balancing

Management

Middleware

Network Security

Nginx

OAuth

OIDC

Onboarding

Orchestration

Performance Tuning

Proxies

Python

Redis

Reliability Engineering

SAN

SSO

SaaS

Scripting

Shell

System Integration

Unix

Web Architecture

Web Servers

Application Support

Big Data

Blueprint

Capacity Management

Cloud Computing

Job Details

Responsibilities
The successful candidate will:
The successful candidate will be involved in application support, application server administration, technical troubleshooting of infrastructure and user incidents
Incorporate Site Reliability Engineering practices into the day-to-day role by developing automated solutions to long-standing problems to ensure minimal downtime and reduce toil
Experience with web architecture implementation including performance, availability, scalability, and disaster recovery planning.
Experience with monitoring and alerting tools, configuring application monitors using industry standard monitoring tools, as well as developing customized monitoring solutions
Revisit SRE Metrics and confirm against the firm and department goals
Identify areas for improvement including automation, toil reduction, resiliency and observability across the platforms and help build up the knowledge and documentation for the team
Partner with other teams in such as enterprise infrastructure, networking, security, storage, and database and data center to roll out application platforms successfully as per the design.
Produce reusable infrastructure designs patterns and periodically review / refresh the patterns.
Support vendor / vendor technology onboarding following the best practices and security blueprint.
Apply technical skills to automate daily support functions, improve system stability, support hygiene initiatives and deliver innovation that creates efficiency and consistency.
Occasional weekend availability and on-call work on a rotation basis.

Required Skills
Strong infrastructure knowledge in Linux / Unix, Databases, Storage and Networking technologies.
Hands-on experience with containers and container orchestration platforms OpenShift / Kubernetes
Experience with scripting in Python and Shell
Hands-on experience of web servers (Apache / Nginx), application integration, configuration, and troubleshooting.
Clear concept of load balancer, web proxies and storage platforms like NAS / SAN from an implementation perspective only.
Familiar with basic security practices to ensure secure hosting solutions, including single sign-on (SSO) and standard encryption protocols.
Prior experience managing large web-based n-tier applications in secure environments on cloud
Strong knowledge SRE Principles with grasp over tools / approach to apply them
Strong infrastructure knowledge in Storage, Networking and Databases
Experience in troubleshooting Application Issues and Managing Incidents
Exposure to tools like Prometheus, Grafana, and Open Telemetry framework
Excellent verbal and written communication skills.

Desired / Nice to have skills
Exposure and experience with data pipeline technologies such as Kafka, Redis and Airflow
Exposure to Big Data platforms like Hadoop / Cloudera and ELK Stack
Capacity planning and performance tuning exercise
Identity management protocols like OIDC / OAuth, SAML, LDAP integration
Cloud Application and infrastructure knowledge is a plus.
Experience in Cloud / Distributed computing technology or certification is a plus

Experience
7 to 12 years in a similar role of hands-on application / middleware specialist.
Prior experience of working in a global financial organization is an advantage

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

SRE Engineer - Onsite interview / Only New York candidate needed

Job Details

About Zealogics

Share