Apply Now

Senior Site Reliability Engineer - Cloud

• Posted 30+ days ago • Updated 5 hours ago

Full Time

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Financial Reporting
Capital Market
Functional Requirements
Collaboration
SaaS
Reliability Engineering
Quality Assurance
Software Engineering
Accountability
Root Cause Analysis
IaaS
Amazon Web Services
Training
Artificial Intelligence
C#
.NET
Java
Microsoft Azure
Ansible
Jenkins
Continuous Integration and Development
Continuous Integration
Continuous Delivery
Scalability
New Relic
Dynatrace
AppDynamics
Writing
Scripting
Windows PowerShell
Python
Bash
System Administration
Customer Facing
DevOps
FOCUS
Terraform
Cosmos
SolarWinds
Database
Red Gate
SQL
Test Scripts
Microsoft Windows
Linux
Kubernetes
Cloud Computing
Computer Networking
Firewall
Load Balancing
Computer Science
Finance
Management
Value Engineering
Communication
Technology Assessment
Business Intelligence

Summary

Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions. At DFIN, we are a values-driven organization that empowers you to build a fulfilling career while bringing your authentic self to work every day. Our "Win as One" mentality ensures that our team's success is directly linked to Client, Shareholder and Employee Satisfaction.

Recognized as one of AMERICA'S MOST LOVED WORKPLACES for five consecutive years and a Built In Best Places to Work for six years, we are committed to our employees' total well-being. Enjoy competitive compensation, a flexible workplace, comprehensive benefits, and opportunities for professional growth. Bring your passion and talents to DFIN - because being YOU thrives here.

Summary:

We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.

The Senior Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.

You either have an SaaS infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with SaaS infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.

Responsibilities:
Champion and implement a culture of SRE to maintain a high-quality platform infrastructure in DFIN SaaS products
Leverage AI tools to enhance system reliability, including intelligent observability, incident prediction and automated remediation across cloud infrastructure
Evaluate and implement emerging AI powered operations and observability solutions to proactively improve system performance, reliability and scalability
Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
Optimize application performance at scale
Automate everything including system operational runbooks
Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies
Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
Learn continuously and apply lessons learned
Evangelize best practices, eliminate bottlenecks, and improve process
Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents

Qualifications:
5+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS
Experience applying AI capabilities within CloudOps operations
Relevant certifications or training in AI, Cloud AI services or AIOps platforms are a plus
5+ years experience writing software in any modern software language such as C# .NET, Java
5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment
5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics
5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows or Linux environments.
5+ years experience supporting public client facing revenue generating systems
Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology
Experiencing monitoring and preventing issues with databases and database queries (SQL, Cosmos) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor
Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts
Experience securing Windows or Linux systems in 24x7 production environment
Experience with containerization and managing Kubernetes clusters (AKS or EKS)
Experience with common cloud networking, firewall and load balancing configuration
BS in Computer Science or equivalent work experience

It is the policy of Donnelley Financial Solutions to select, place, and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran status, actual or perceived sexual orientation, genetic information or any other protected status.

If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access jobs.dfinsolutions.com as a result of your disability. You can request a reasonable accommodation by sending an email to

At DFIN, protecting your identity is a top priority. Please be aware of scammers impersonating DFIN recruiters. DFIN recruiters will never request personal information via email or text. You will only receive a text from us if you've already been in contact. All automated messages will come from If you ever have doubts about the legitimacy of any communication from us, please do not hesitate to reach out for verification via (this email is for general TA questions and is not used for updates on your application status). #BI-Remote

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91081931
Position Id: cf0f521c3d9bf3762a5b09f52a2783db
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

No location provided

•

Today

Job Description Join Oracle's Health Data Intelligence (HDI) team as a Software Engineer 3, focused on Site Reliability Engineering for large-scale healthcare analytics platforms. In this role, you will design, build, and operate highly reliable, scalable infrastructure and data pipelines that power mission-critical analytics globally. You will also contribute to the next evolution of cloud operations by advancing automation, observability, and AI-assisted reliability practices. This includes

Full-time

USD 79,100.00 - 158,200.00 per year

Sr Site Reliability Engineer (Advanced Threat Protection)

California

•

Today

Our Mission At Palo Alto Networks , we're united by a shared mission-to protect our digital way of life. We thrive at the intersection of innovation and impact, solving real-world problems with cutting-edge technology and bold thinking. Here, everyone has a voice, and every idea counts. If you're ready to do the most meaningful work of your career alongside people who are just as passionate as you are, you're in the right place. Who We Are In order to be the cybersecurity partner of choice, w

Full-time

USD 120,300.00 - 194,525.00 per year

Site Reliability Engineer

Greenwich, Connecticut

•

Today

Join our dynamic team as a Site Reliability Engineer, where you'll play a crucial role in enhancing the performance and reliability of our hybrid infrastructure. This is an exciting opportunity for those eager to dive into hands-on infrastructure and reliability engineering in a complex, real-world environment. Responsibilities Monitor and respond to incidents in production systems using enterprise observability tools. Assist in maintaining both on-premises and cloud infrastructure across vari

Full-time

USD 110,000.00 - 150,000.00 per year

Cloud Site Reliability Engineer (SRE) - Data Management & Analytics Platform

Princeton, New Jersey

•

Today

Description & Requirements At Bloomberg, data is at the heart of everything we do. As part of the Data Management and Analytics Platform (DMAP) SRE team you will play a critical role in driving analytics throughout the organization to improve our products, better engage with our customers, create greater efficiencies, and unlock new business opportunities through data-driven insights. Our team is responsible for capturing and processing the who, what, when, where, and why of how clients use Blo

Full-time

USD 160,000.00 - 240,000.00 per year

Search all similar jobs

Senior Site Reliability Engineer - Cloud

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs