Site Reliability Engineer/DevOps Engineer

Overview

Hybrid
$65 - $75
Full Time

Skills

Amazon Web Services
Terraform
Kubernetes
Docker
IAM

Job Details

Site Reliability Engineers focus on optimizing release deployments, maintaining secure cloud infrastructure, and handling day-to-day operational. They are also responsible for ingesting new solutions and products downstream from the Build/Automation organization and using a wealth of monitoring and logging tools to solve a broad spectrum of issues. Post- mortem and proactive identification of potential issues factor into iterative improvement.

Operations and Site Reliability s culture of diversity, intellectual curiosity, problem solving, and responsibility are the keys to its success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think differently and take remediation of issues across the finish line. We promote teamwork and group-based troubleshooting and offer an environment that provides support and mentorship to learn and grow.

Behind everything our customers see, the systems maintained by the Operations and Site Reliability team keep it secure and running. We make SAP NS2 s product portfolio available. We re always trying to ensure our customers have the best and consistent possible experience.

  • Setup, monitor and maintain DevOps cloud based SAAS products and solutions involving microservices, cloud, and containers.
  • Maintain security and data privacy, ensure compliance, and perform required security, compliance and performance tests
  • Work with architects on deployment architecture, security and CI/CD and then implement these on NS2 s cloud environments
  • Setup and maintain Kubernetes clusters on cloud environments.
  • System troubleshooting and problem solving across platform and application domains
  • Analyze and solve operational issues, and respond to incidents
  • Experience working with appropriate complex systems administration, database administration, and landscape maintenance
  • Experience maintaining the integrity and security of servers and systems
  • Experience developing and monitoring policies and standards for allocation related to the use of computing resources
  • Conduct root cause analysis and implement continuous improvements
  • Experience in developing and implementing testing strategies and documenting results
  • Work in a diverse environment and cross-train with other global team members
  • Evaluate new technology options and vendor products

Required Experience/Skills:

  • BS/BA degree in Computer Science, Management Information Systems, or related IT discipline preferred
  • ALLOWABLE SUBSTITUTION: An additional four (4) years of experience can be substituted for a BS or BA degree
  • 8+ years of experience
  • Expertise with source code management such as SVN, GitHub or GitLab
  • Experience with binary resource management tools such as JFrog Artifactory or Harbor
  • Strong background in Linux/Unix administration
  • Expertise with building, implementing, and/or supporting monitoring tools
  • Experience deploying high volume applications in Google Cloud Platform, AWS or Azure using automation.
  • Expert understanding of web services, networking, virtualization, and internet protocols
  • Ability to multitask and handle various projects, deadlines and changing priorities
  • Excellent communication and prioritization skills
  • Expertise with security fundamentals as they pertain to SaaS Multitenant Application systems
  • Strong interpersonal, presentation, and customer service skills
  • FedRAMP security fundamentals

Desired Experience/Skills:

  • Experience deploying, monitoring, and maintaining SaaS based products and solutions
  • System troubleshooting and problem solving across platform and application domains
  • Evaluate new technology options and vendor products
  • Ensuring critical system security through the use of best-in-class cloud security solutions.
  • 8+ years of experience with automation orchestration, and configuration management tools such as Chef/Ansible, Terraform and Jenkins/Concourse
  • 5+ years of experience with Docker and container orchestration using Kubernetes
  • 5+ years of experience with infrastructure as code environments, including any activities around automated server or network configurations, large-scale software deployments, and monitoring and testing
  • 5+ years of experience in a DevOps role in projects involving Java, NodeJS, J2EE, messaging, Microservices and Containers for complex, scalable, SAAS or enterprise grade products
  • 5+ years of experience with AWS Route 53, EC2, S3, CloudWatch, DynamoDB, RDS, IAM, ACM, KMS, VPC
  • Exposure to and understanding of troubleshooting IP networks and application stacks
  • Ability to work with distributed teams in a collaborative and productive manner
  • Strong problem solving, analytical, design, architecture, decision- making, and communication skills.
  • Self-driven and motivated with the desire to work in a fast-paced, results-driven agile environment with varied responsibilities
  • Experience with Cloud Foundry

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.