Site Reliability Engineer Architect Jobs in 30301

Refine Results
1 - 20 of 90 Jobs

SRE Architect - 100% remote - Full Time Only

Alpha Silicon

Remote

Full-time

Please share your Resume to or you can call me directly at +1- Roles & Responsibilities: 18+ years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience 8+ years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments / distributed systems for major Banking custom

Production Support Engineer (SRE & J2EE)

EPSoft Technologies LLC

Atlanta, Georgia, USA

Full-time, Third Party, Contract

Job Title: Production Support Engineer (SRE & J2EE) Job Summary: A Production Support Engineer with Site Reliability Engineering (SRE) capabilities in a J2EE environment, equipped with problem-solving skills, plays a pivotal role in ensuring the stability, performance, and continuous improvement of web applications and services. Key Responsibilities: Incident Management: Lead the response to production issues, ranging from identifying and troubleshooting problems to implementing immediate fixes

Site Reliability Engineer - SRE Engineer at Atlanta, GA

Parmesoft Inc.

Atlanta, Georgia, USA

Contract

Please send the resume to augustin (at) parmesoft (dot) com VERY URGENT REQUIREMENT Job title: Site Reliability Engineer - SRE Engineer Location: Atlanta, GA (Hybrid, 2 Days/week onsite in Atlanta, GA is a must) Duration: 8 Months Rate: $58-$60/Hr all-inc Client wants the consultant to be 100% onsite in Atlanta, GA from day 1 2 Days/week onsite in Atlanta, GA is a must Description: Technical knowledge : 1. Google Cloud Platform (Any Cloud Platform would be ok preferable Google Cloud Platform)

Need Site Reliability Engineer (Onsite)

Spar Information Systems

Atlanta, Georgia, USA

Full-time, Contract

Hope you are doing good. I have urgent requirement kindly let me know if you have any resource available. Position: Site Reliability Engineer Location: St. Louis, MO/ Atlanta, GA (Hybrid) | St. Louis, MO being first preference Duration: 8 months+ Job Description: Looking for a highly motivated Site Reliability Engineer, who is capable of build and run large-scale, massively distributed, fault-tolerant systems. Individual to work with teams across the organization and ensures core services reli

Senior Software Engineer (Site Reliability Engineering), Python, AWS (Face to Face interview is MUST)

Xoriant Corporation

Atlanta, Georgia, USA

Contract

Senior Software Engineer (Site Reliability Engineering), Python, AWS (Face to Face interview is MUST) ** Pay rate range is between $75 - $80 per hour on W2** ** onsite interview is required ** ** Yes, we are onsite by mandate 4-10 days per month. Our teams typically work Wednesdays each week. The day may change based on the needs of the teams the SRE supports. Top 5 Must Haves: Extensive/Strong AWS experienceexperience in designing, deploying managing scalable/reliable cloud-based infrastructur

Sr. Site Reliability Engineer

AgreeYa Solutions

Atlanta, Georgia, USA

Contract

Job Description: Lead and mentor a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence. Drive the adoption of SRE best practices and ensure adherence to reliability and performance standards.Design and implement highly available, scalable, and fault-tolerant systems using AWS.Collaborate with software engineering teams and other SREs to influence design and architecture decisions to improve system reliability and performance.Develop and maintain au

Urgent Req- Google Cloud Platform SRE Lead -in Atlanta,GA (onsite/Hybrid)- Full time role

Alpha Silicon

Atlanta, Georgia, USA

Full-time

Title: Google Cloud Platform SRE Lead Location: Atlanta GA Terms: Full Time Job Description As Lead SRE, identify and implement SLI's to achieve the best SLO in the environment. Implement the SRE concepts to fill the gaps and issues facing by development team. Handle Deployments with solid deployment and release strategies. Troubleshoot - issues identified and collaborate with the development team to provide permanent fix. Improve the alert and monitoring dashboard setup. Work on automations

Site Reliability Engineer - Local to GA

Advansys Inc

Alpharetta, Georgia, USA

Contract

Position: Site Reliability Engineer Location: Alpharetta, GA Duration: Long Term Need a strong SRE consultants local to GA

Site Reliability Engineer (SRE) - Java Production Support

Photon Interactive UK Limited

Alpharetta, Georgia, USA

Contract, Third Party

Site Reliability Engineer (SRE) Lead - Java Production Support Overview: We are seeking a skilled Site Reliability Engineer (SRE) with a strong background in production support to join our team. The ideal candidate will possess expertise in Java/J2EE development, Spring framework, PostgreSQL database management, and proficiency in tools such as Splunk, Dynatrace, Harness, Groovy, and Jira administration. This role requires a proactive problem-solving mindset, a keen eye for performance optimizat

SRE

CloudZenix, LLC

Remote

Contract

Contract W2 Exp: 10+ Years As a Site Reliability Engineer, your role is to provide reliability engineering services through observability and performance engineering techniques. Using monitoring and performance tools to deliver detailed feedback to product owners and development teams. You will partner with Product Owners to define service-level objectives and develop service-level indicators. Collaborate with cross-functional teams to design, build, automate, and maintain scalable infrastructur

Site Reliability Engineer (SRE) - Grafana Observability

LaSalle Network

Remote

Contract

LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of the observability stack into the Grafana ecosystem. This role is a contract opportunity with the potential to extend or convert to a full t

Urgent Remote Opportunity || SRE Candidate || Contract role

IT First Source

Remote

Contract

Urgent Remote Opportunity || SRE Candidate || Contract role Job description Hands-on functional experience in design, configure & customizing in SAP CRM 7.0: BP, Master Data, Technical Master Data, SAP CRM Fiori apps, HANA, Gateway OData services, Integration with Fiori.Service: Service Contracts, Service Requests, Service Orders, Install base/Equipment/Technical Master Data.Business Activity, Task, Service Tickets, ERMS.Senior SAP CRM Service Functional Analyst/Project manager.Experience in bus

AI SRE Support

Technostrides

Remote

Full-time

Role: AI SRE Support (Mid-Level) Mid-Level: 9+ Years Location: NC/Remote/US-Look for local candidates Duration: 6+ Months Job Description: Must Have: SRE, DevOps, AI, Security Infrastructure sideAct as production Gatekeeper for all changes (Product and infrastructure changes)Perform detailed deep dive (root cause analysis) on the repeated system issues and work with engineering team for permanent solutionProvide support as Tier2 application/platform support for Optum AI applicationsPeriodic on

Site Reliability Engineering (SRE) Lead (W2 ONLY)

ALTA IT Services

Remote

Full-time

Site Reliability Engineering (SRE) Lead100% Remoteship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end continuous delivery pipelines and experience in AI/ML. You will also use your experience working closely with developers and other engineers to identi

Director Of Platform Engineering - Remote - Upto $300K - EST

SoCode

Remote

Full-time

Director of Platform Engineering - Cutting edge AI - Upto $300K - Fully Remote - Eastern Time A leading AI company have developed a powerful engine to solve some of the most complex problems within science and technology. Their team of renowned scientists and engineers are using LLMs to help make better decisions and solve critical business problems. After generating $100M in funding, they are looking for a Director of Platform Engineering to lead and scale the team. What's on offer: Basic salar

SRE(Site Reliability Engineer) Lead #W2

Swarky Solutions

Remote

Full-time, Contract

(they will convert to permanent) Title: SRE (Site Reliability Engineer) Lead Location: Remote Duration: 4-12 month contract to hire Top Skills and Technologies: 7+ years experience with Oracle Database: (basic understanding) OpenShift: (basic understanding) MongoDB (basic understanding) Big Data Hadoop: (basic understanding) RedHat Linux: (basic understanding) API's (basic understanding) WebSphere (manage, install, maintain, and troubleshoot the WebSphere application server software): (basic

Site Reliability Engineer

Atmecs Global Inc.

Remote

Contract

Hi, Position Title: Site Reliability Engineer Location: Remote Duration: 12 Months Bachelor's degree in Computer Science or related field.9+ years of experience in software engineering or SRE roles, with a focus on large scale distributed systems.Strong coding skills in at least one programming language, such as Java, Python, or Go.Experience with distributed systems and service-oriented architectures.Experience with cloud computing platforms such as AWS or Google Cloud Platform.Strong convictio

Sr SRE (Site Reliability Engineer req)

InfoVision, Inc.

Atlanta, Georgia, USA

Contract, Third Party

Job Title: Sr Systems SRE Location: Atlanta, GA Technical knowledge : 1. Google Cloud Platform (Any Cloud Platform would be ok preferable Google Cloud Platform) 2. Strong Terraform Knowledge 3. Understanding of micro service architecture, Infrastructure , Network Responsibilities: Monitoring: Application and Infrastructure Monitoring Automating: Automating the deployment process and automating toil-reducing automation Improving: Improving the software development lifecycle by holding post-incid

Datadog Subject Matter Expert - W2 - Remote - Any Visa except H1B

Shiro Technologies

Remote

Contract

Key Skills : Datadog administration, Datadog APM, Cloud Integration, Deployment, YAML, DevOps SRE, SIEM, New Relic, Splunk, AppDynamics, Python, Powershell, and/or Bash scripting, linux. Datadog Certified Associate or Datadog Certified Professional are preferred Core Skills needed : Very Strong with Datadog Administration. Should have set up Datadog from the scratch. Very strong experience to integrate Datadog with Cloud applications or On-prem. Strong with Datadog APM. Analyze current environme

* Senior Site Reliability Engineer *

Kellton

Remote

Third Party, Contract

Kellton Tech is a full-service software development company, offering end-to-end IT solutions, strategic technology consulting and product development services in Web, SMAC (Social, Mobile, Analytics, Cloud), ERP-BPM, and IoT space Our methodology of inventing infinite possibilities with technology helps us develop best in-class and cost effective solutions for our clients. Currently Kellton Tech is looking for talented resources for one of our listed client. Below are the position details: Pos