Site Reliability Engineer Tech Lead

• Posted 5 days ago • Updated 5 days ago
Full Time
On-site
USD $145,000.00 - 217,000.00 per year
Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

  • Finance
  • Operational Risk
  • Software Engineering
  • IT Operations
  • Reliability Engineering
  • High Availability
  • Performance Metrics
  • Performance Tuning
  • Software Management
  • Collaboration
  • Regulatory Compliance
  • Capacity Management
  • Forecasting
  • Documentation
  • Emerging Technologies
  • Incident Management
  • Ansible
  • Terraform
  • Jenkins
  • Elasticsearch
  • Software Performance Management
  • Root Cause Analysis
  • Scripting
  • Python
  • Bash
  • Java
  • IaaS
  • Microsoft Azure
  • Google Cloud Platform
  • Google Cloud
  • Docker
  • Kubernetes
  • Provisioning
  • Continuous Integration
  • Continuous Delivery
  • Configuration Management
  • Budget
  • Data Management
  • Workflow
  • MongoDB
  • Snow Flake Schema
  • SQL
  • CA Workload Automation AE
  • BMC Control-M
  • Orchestration
  • Computer Networking
  • Database
  • Recovery
  • Agile
  • Legacy Systems
  • Scalability
  • Productivity
  • Mentorship
  • Continuous Improvement
  • Operational Excellence
  • Cloud Computing
  • Amazon Web Services
  • DevOps
  • Computer Science
  • Information Technology
  • Accountability
  • FOCUS
  • Communication
  • SAFE
  • Policies and Procedures
  • Privacy
  • Training
  • LOS
  • FLSA

Summary

At Freddie Mac, our mission of Making Home Possible is what motivates us, and it's at the core of everything we do. Since our charter in 1970, we have made home possible for more than 90 million families across the country. Join an organization where your work contributes to a greater purpose.

Position Overview:

At Freddie Mac, you will do important work to build a better housing finance system, and you'll be part of a team helping to make rental housing more accessible and affordable across the nation.

The Technology & Operational Risk department within the Multifamily (MF) division is seeking a Site Reliability Engineer (SRE) who will blend software engineering with IT operations to ensure the reliability, availability, scalability, in the performance of key systems, services, and environments.

Our Impact:

At Freddie Mac, our mission of Making Home Possible is what motivates us, and it's at the core of everything we do. Since our charter in 1970, we have made home possible for more than 90 million families across the country. Join an organization where your work contributes to a greater purpose.

Your Impact:
  • System Reliability: Design, implement, and maintain automated solutions to ensure high availability, resiliency, and scalability of applications and services.
  • Incident Management: Collaborate with stakeholders to respond to production incidents, develop protocols to minimize downtime, conduct postmortems, and implement preventive measures to avoid recurrence.
  • Monitoring & Observability: Set up monitoring systems to track performance metrics, meeting system health and performance targets and addressing potential issues before they impact users.
  • Performance Optimization: Analyze system performance, identify bottlenecks, and optimize for speed, scalability, and resource utilization.
  • Automation: Leverage automation tools to reduce manual interventions in application management tasks and ensure efficiency, repeatability, and minimal human error.
  • Collaboration: Work closely with stakeholders to support new features, deployments, and compliance initiatives.
  • Capacity Planning: Forecast resource needs and plan for future growth to ensure system stability and scalability.
  • Documentation: Create and maintain up-to-date documentation for systems, processes, and troubleshooting procedures.
  • Continuous Improvement: Exhibit the intellectual curiosity to continuously learn emerging technologies and practices to design and deliver best of breed solutions for MF Technology

Qualifications:
  • Proven expertise in designing, developing, and maintaining automation frameworks for application operations, including infrastructure provisioning, deployment pipelines, monitoring, and incident response, using tools such as Ansible, Terraform, Jenkins, and related technologies.
  • Extensive experience with observability and monitoring platforms (Elasticsearch Observability, Elasticsearch APM, OpenTelemetry), with a focus on automating system health checks, alerting, and root cause analysis.
  • Strong proficiency in programming and scripting languages (e.g., Python, Go, Bash, Java), with a track record of automating repetitive operational tasks and building self-healing solutions.
  • Hands-on experience with cloud infrastructure (AWS, Azure, Google Cloud Platform) and container orchestration (Docker, Kubernetes, EKS), including automated provisioning, scaling, and recovery of resources.
  • Demonstrated ability to lead and implement transformative initiatives that reduce manual toil, streamline operational workflows, and drive continuous improvement in reliability and efficiency.
  • Experience with CI/CD tools and configuration management for fully automated build, test, and deployment pipelines.
  • Deep understanding of SRE principles such as SLIs, SLOs, error budgets, and applying automation to enforce and improve these metrics.
  • Experience with data management platforms and automation of data workflows (e.g., MongoDB, Snowflake, SQL, Dremio, Qlik Replicate).
  • Familiarity with enterprise job schedulers (Autosys, Control-M) and automation of batch processes and job orchestration.
  • Solid foundation in networking, databases, and distributed systems, with experience automating troubleshooting and recovery procedures.
  • Experience with agile and DevOps cultures, driving adoption of automation best practices across teams.
  • Track record of championing automation-first initiatives that modernize legacy application operations and deliver measurable improvements in reliability, scalability, and team productivity.
  • Ability to mentor and guide teams in adopting automation tools and practices, fostering a culture of continuous improvement and operational excellence.
  • Relevant certifications in cloud, automation, or SRE/DevOps (e.g., AWS DevOps Engineer, Google SRE) are a plus.
  • Bachelor's degree in computer science, information technology, or related field (or equivalent experience).

Keys to Success in this Role:
  • Demonstrate a sense of accountability and ownership to identify and drive areas of improvement.
  • Focus on achieving results, influencing and collaborating with stakeholders to independently deliver desired outcomes.
  • Cultivate and maintain trusted relationships with Multifamily and Enterprise teams.
  • Ability to exhibit clear and persuasive communication skills, capable of conveying complex information and vision for excellence to stakeholders.
  • Ability to work independently, persistently, and collaboratively in a fast-paced environment.
  • Ability to work evenings and weekends as needed

Current Freddie Mac employees please apply through the internal career site.

We consider all applicants for all positions without regard to gender, race, color, religion, national origin, age, marital status, veteran status, sexual orientation, gender identity/expression, physical and mental disability, pregnancy, ethnicity, genetic information or any other protected categories under applicable federal, state or local laws. We will ensure that individuals are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

A safe and secure environment is critical to Freddie Mac's business. This includes employee commitment to our acceptable use policy, applying a vigilance-first approach to work, supporting regulatory mandates, and using best practices to protect Freddie Mac from potential threats and risk. Employees exercise this responsibility by executing against policies and procedures and adhering to privacy & security obligations as required via training programs.

CA Applicants: Qualified applications with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.

Notice to External Search Firms: Freddie Mac partners with BountyJobs for contingency search business through outside firms. Resumes received outside the BountyJobs system will be considered unsolicited and Freddie Mac will not be obligated to pay a placement fee. If interested in learning more, please visit and register with our referral code: MAC.

Time-type:Full time

FLSA Status:Exempt

Freddie Mac offers a comprehensive total rewards package to include competitive compensation and market-leading benefit programs. Information on these benefit programs is available on our Careers site.

This position has an annualized market-based salary range of $145,000 - $217,000 and is eligible to participate in the annual incentive program. The final salary offered will generally fall within this range and is dependent on various factors including but not limited to the responsibilities of the position, experience, skill set, internal pay equity and other relevant qualifications of the applicant.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90922487
  • Position Id: 24414969
  • Posted 5 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Dallas, Texas

Today

Full-time

USD 145,000.00 - 217,000.00 per year

Dallas, Texas

Today

Full-time

Dallas, Texas

Today

Easy Apply

Full-time

$80,000 - $120,000

Westlake, Texas

Today

Contract

Search all similar jobs