SRE/ Cloud Platform Engineer

  • Plano, TX
  • Posted 5 days ago | Updated 5 days ago

Overview

On Site
Up to $70
Contract - W2
Contract - 12 Month(s)

Skills

Amazon Web Services
Apache Cassandra
Apache Kafka
Apache Tomcat
Backup Administration
Banking
Cloud Computing
Debugging
IBM WebSphere MQ
Incident Management
Communication
Continuous Improvement
Performance Tuning
Physical Layer
Mainframe
Management
Messaging
Microsoft Windows
Middleware
Data Link Layer
Database Administration
Issue Tracking
Java
Kubernetes
Software Development Methodology
Systems Engineering
Unix
Linux
Oracle
Python
ROOT
Scalability
Scripting
Shell
Terraform

Job Details

SRE/ Cloud Platform Engineer

Locations: Plano TX- Only Nearby F2F require
Duration: 12+ months (with possible extension or CTH)

W2 Candidates only

F2F on 5th June and 6th June 2025 in Plano TX

Must have: Cloud + Terraform + IAC + Kubernetes + AWS + Banking +(ECS OR EKS) with monitoring + (Incident management or ticketing systems any or handling tickets)

Key Responsibilities:
Deliver incident management and advanced-level L1/L2 support for internal applications across public cloud platforms, with a strong emphasis on AWS.
Serve as the initial point of contact for application developers via a ticketing system.
Communicate effectively with users at various organizational levels.
Implement and utilize automation to support the scalability of the environment.
Optimize operational processes to enhance efficiency, reliability, and security.
Train users to self-diagnose and troubleshoot issues for expedited resolution.
Conduct thorough investigations into issues to identify root causes and document strategies to prevent recurrence.
Provide support for public cloud environments, particularly AWS.
Manage events and incidents efficiently.
Develop and implement scalable automation processes to handle tasks in a large-scale environment.
Analyze and debug incidents, following up to gather feedback and prevent future issues.
Support different development environments, including Unix, Linux, Mainframe, and Windows.

Required Skills and Experience:
Proficiency in SDLC with the ability to read code (Java and Python).
Hands-on scripting experience (Unix shell, Python).
Extensive cloud experience, particularly with AWS.
Expertise in Kubernetes.
Strong troubleshooting and diagnostic skills for security and access issues in a large enterprise environment.
Database management skills (Oracle DBA, Cassandra DBA, CockroachDB) including performance tuning, connectivity, backups, indexes, and monitoring alarms.
Middleware and messaging experience (Kafka, MQ).
Experience with Tomcat.
System engineering and administration skills (Unix/Linux).
Familiarity with monitoring tools and ticketing systems.
Commitment to automating processes for continuous improvement.
Excellent communication skills.
Ability to analyze details, understand incident causation, and implement preventive measures to ensure

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About CICD Global