Overview
On Site
USD 172,000.00 - 334,600.00 per year
Full Time
Skills
Software Engineering
Customer Relationship Management (CRM)
Blaze
Value Engineering
Reliability Engineering
Systems Engineering
Expect
Software Development
Management
Algorithms
Systems Design
Conflict Resolution
Problem Solving
Mentorship
Python
Java
Optimization
Amazon Web Services
Google Cloud
Google Cloud Platform
Amazon EC2
Virtual Private Cloud
Amazon S3
Kubernetes
Orchestration
Machine Learning (ML)
Grafana
Git
Workflow
Configuration Management
Terraform
Ansible
Puppet
Linux
Systems Architecture
Build Automation
Incident Management
ROOT
Continuous Integration
Continuous Delivery
SAFE
Collaboration
Artificial Intelligence
Cloud Computing
Agile
Scrum
DevSecOps
Microservices
Regulatory Compliance
Communication
Documentation
Knowledge Sharing
MEAN Stack
SAP BASIS
Law
Promotions
Training
Insurance
Purchasing
LOS
Salesforce.com
Recruiting
Job Details
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.
Job Category
Software Engineering
Job Details
About Salesforce
We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good - you've come to the right place.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Salesforce services have reliability, capacity, performance and the availability to deliver our customer's needs and a rate of improvement that our customers expect.
Our software development focuses on enabling service owners to operate their services safely at scale, whether through paved path integrations onto observability frameworks, optimizing existing systems, designing infrastructure or eliminating work through AI/ML. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Salesforce, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. Experience with AI/ML systems, autonomous agents, or observability for intelligent platforms is a strong plus.
SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
Required Skills
- 5+ years of experience in Python, Go, or Java for automation, tooling, and integration.
- Hands-on experience designing, building and operating large scale distributed systems, identifying shortcomings and optimization opportunities
- Demonstrated experience in developing and deploying production-grade software applications or services.
- Strong experience with AWS or Google Cloud Platform and services like EC2, VPC, IAM, S3, EKS.
- Expertise in Kubernetes and modern container orchestration.
- Deep understanding of SRE principles: SLIs/SLOs, availability, resiliency, and incident metrics (TTD, TTR).
- Experience with AI/ML platforms, agents, or intelligent observability systems.
- Familiarity with observability tooling: Grafana, OpenTelemetry, Zipkin/Jaeger, and TSDBs.
- Hands-on with CI/CD pipelines and Git-based workflows.
- Experience with IaC and config management tools: Terraform, Helm, Ansible, or Puppet.
- Strong Linux systems knowledge and troubleshooting skills.
- Data-driven mindset for identifying systemic issues and improving service reliability.
Responsibilities
- Define and implement SLIs/SLOs with engineering teams, driving reliability into system architecture.
- Build automation and self-healing capabilities to reduce manual operations.
- Operate and scale monitoring, alerting, and tracing systems for proactive issue detection.
- Lead post incident analysis, conduct postmortems, and ensure effective root cause resolution.
- Improve CI/CD practices to accelerate safe, frequent deployments.
- Use data to uncover trends, inform prioritization, and drive platform improvements.
- Collaborate on integrating AI-driven automation and observability to enhance reliability.
- Support and scale multi-cloud, multi-region services.
- Work within Agile teams, participating in SCRUM ceremonies and iterative delivery.
Desired Skills
- Familiarity with DevSecOps practices and secure pipeline integration.
- Knowledge of microservices, service mesh, or zero-trust infrastructure.
- Experience operating in global, multi-tenant, or compliance-sensitive environments.
- Strong written and verbal communication, with emphasis on documentation and knowledge sharing.
Accommodations
If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.
Posting Statement
Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that's inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications - without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.
In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.
For California-based roles, the base salary hiring range for this position is $172,000 to $334,600.
Job Category
Software Engineering
Job Details
About Salesforce
We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good - you've come to the right place.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Salesforce services have reliability, capacity, performance and the availability to deliver our customer's needs and a rate of improvement that our customers expect.
Our software development focuses on enabling service owners to operate their services safely at scale, whether through paved path integrations onto observability frameworks, optimizing existing systems, designing infrastructure or eliminating work through AI/ML. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Salesforce, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. Experience with AI/ML systems, autonomous agents, or observability for intelligent platforms is a strong plus.
SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
Required Skills
- 5+ years of experience in Python, Go, or Java for automation, tooling, and integration.
- Hands-on experience designing, building and operating large scale distributed systems, identifying shortcomings and optimization opportunities
- Demonstrated experience in developing and deploying production-grade software applications or services.
- Strong experience with AWS or Google Cloud Platform and services like EC2, VPC, IAM, S3, EKS.
- Expertise in Kubernetes and modern container orchestration.
- Deep understanding of SRE principles: SLIs/SLOs, availability, resiliency, and incident metrics (TTD, TTR).
- Experience with AI/ML platforms, agents, or intelligent observability systems.
- Familiarity with observability tooling: Grafana, OpenTelemetry, Zipkin/Jaeger, and TSDBs.
- Hands-on with CI/CD pipelines and Git-based workflows.
- Experience with IaC and config management tools: Terraform, Helm, Ansible, or Puppet.
- Strong Linux systems knowledge and troubleshooting skills.
- Data-driven mindset for identifying systemic issues and improving service reliability.
Responsibilities
- Define and implement SLIs/SLOs with engineering teams, driving reliability into system architecture.
- Build automation and self-healing capabilities to reduce manual operations.
- Operate and scale monitoring, alerting, and tracing systems for proactive issue detection.
- Lead post incident analysis, conduct postmortems, and ensure effective root cause resolution.
- Improve CI/CD practices to accelerate safe, frequent deployments.
- Use data to uncover trends, inform prioritization, and drive platform improvements.
- Collaborate on integrating AI-driven automation and observability to enhance reliability.
- Support and scale multi-cloud, multi-region services.
- Work within Agile teams, participating in SCRUM ceremonies and iterative delivery.
Desired Skills
- Familiarity with DevSecOps practices and secure pipeline integration.
- Knowledge of microservices, service mesh, or zero-trust infrastructure.
- Experience operating in global, multi-tenant, or compliance-sensitive environments.
- Strong written and verbal communication, with emphasis on documentation and knowledge sharing.
Accommodations
If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.
Posting Statement
Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that's inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications - without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.
In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.
For California-based roles, the base salary hiring range for this position is $172,000 to $334,600.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.