Site Reliability Engineer

Overview

On Site
USD55 - USD65
Contract - W2

Skills

Site Reliability Engineer

Job Details

job summary:


  • The Global Risk Analytics technology team is seeking an infrastructure support engineer with a wide breath of experience in GRID computing, data storage clusters, high performance computing (HPC) and application development.



location: Charlotte, North Carolina

job type: Contract

salary: $55.06 - 65.06 per hour

work hours: 8am to 5pm

education: Bachelors



responsibilities:


  • Develop and maintain monitoring and alerting solutions tailored for monitoring decoupled Storage(Hadoop) and Compute(Spark Grid) production and production-like environments.
  • Build internal tools that integrate with existing monitoring platforms (e.g. Influx, Grafana, Telegraf) to collate and derive insights from production metrics.
  • Proactively monitor and manage the Infrastructure - Cloudera Hadoop and Apache Spark .
  • Support application releases, implementations, upgrades (RHEL, Spark), and/or changes into the environment.


qualifications:


  • Build strong partnership with Development, Infrastructure and platform teams.
  • Configures and maintains the set of tools and services that provide Continuous Integration and Continuous Delivery (CI/CD) services and validates access control mechanisms for the Software and Infrastructure Engineering team throughout the software development lifecycle.
  • 3-5 years of experience with logging and monitoring platforms - Influx/Prometheus, Grafana etc.
  • System Admin experience with Linux, Influx/Prometheus and Grafana tools.
  • Demonstrated ability to build observability tooling or integrations that serve multiple internal teams.


skills:

  • Proficiency in scripting (e.g., Python, Shell) to create diagnostic and observability tools.
  • Experience with Git and version control workflows.
  • Knowledge of CI/CD tools (Jenkins, GitHub) and Ansible is must.
  • Exceptional debugging and root cause analysis skills across application, infrastructure, and network layers.
  • Ability to multi-task, as well as being flexible to assist team members as needed.
  • Experience with secrets management.
  • Experience in a site reliability role.




Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact

Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).

This posting is open for thirty (30) days.


Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.