job summary:
A rapidly growing, well-funded fintech innovator leveraging AI to redefine automated financial lifecycles is seeking a Staff Site Reliability Engineer to spearhead platform evolution in New York City. Operating as a senior individual contributor in a high-impact, onsite role, you will shape the future of a highly scalable cloud infrastructure and elevate operational excellence across core engineering teams. This permanent position offers a highly competitive compensation package alongside premium health benefits, including 100% employer-covered medical, dental, and vision insurance.
location: New York, New York
job type: Permanent
salary: $180,000 - 250,000 per year
work hours: 8am to 5pm
education: No Degree Required
responsibilities:
.Responsibilities In this role, your daily focus will center on scaling infrastructure and championing system availability. On a day-to-day basis, you will:
- Lead the architectural direction and evolution of AWS-based cloud infrastructure, driving strategic migrations to modern, scalable runtimes.
- Design, build, and optimize robust CI/CD systems with an absolute focus on automation, safety, and enhancing the overall developer experience.
- Establish and manage ephemeral environments and preview deployment systems to accelerate release cycles and bolster deployment confidence.
- Define, monitor, and evolve comprehensive observability standards, including metrics, logging, tracing, alert hygiene, and SLO/SLI development.
- Drive the incident response life cycle, conduct blameless postmortems, and actively nurture a proactive reliability culture across the organization.
- Partner closely with cross-functional engineering teams to design highly resilient, distributed systems while minimizing operational toil.
- Mentor engineers and advocate for system-level best practices regarding blast radius, risk management, and feedback loops.
qualifications:
Qualifications
Must-Haves
10+ years of dedicated experience in Site Reliability Engineering, cloud infrastructure, or production backend engineering.
Strong software engineering background with proficiency in one or more modern programming languages.
Proven expertise managing and operating large-scale distributed systems within high-traffic production environments.
Deep hands-on experience with architectural design, scaling, and administration within AWS.
Demonstrated mastery of modern observability tooling and advanced automated CI/CD frameworks.
Ability to work full-time onsite at our modern office location in Soho, New York City.
Nice-to-Haves
Experience leading large-scale platform migrations (such as moving containerized workloads from basic container services to advanced runtime orchestration frameworks).
Proven track record navigating ambiguous environments and setting clear technical direction within fast-moving, high-growth startup cultures.
Skills
Cloud Infrastructure: AWS administration, runtime migrations, scalable systems design
Automation & CI/CD: Automated deployment pipelines, ephemeral environment configuration, GitHub Actions
Observability & Reliability: SLO/SLI development, metrics tracking, logging, distributed tracing, alert management, incident response leadership
Software Engineering: Distributed systems architecture, modern programming languages
Core Competencies: Technical mentorship, cross-functional collaboration, systemic thinking, risk mitigation, blameless communication
Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.
At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact
Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).
This posting is open for thirty (30) days.
![]()