Google Cloud Platform Site Reliability Architect

google cloud platform, Google Cloud, site reliability, sre, reliability, Docker, Jenkins, Kubernetes, Ansible, Automation, Cloud, Configuration management, Continuous integration, Cross-functional, Agile, Scripting, Python, ecommerce, gke, cicd
Full Time
$180,000 - $200,000
Travel not required

Job Description

CMK Resources is seeking a remote, full-time Google Cloud Platform Site Reliability Architect for one of our premier nation-wide B2C clients with over 2000 locations to bring leading edge technologies both on-premise and in the cloud (Google Cloud Platform). 

Automation and superior software quality/performance and resiliency will be your mindset. You will be an expert resource in software and operational high-performance design patterns and support different development, architecture and operational teams from start to finish to create scalable and resilient solutions multi-location highly available solutions.
Responsibilities

  • Design, architect and build Google Cloud Platform solutions with SRE and operational teams for performance/capacity related issues associated with complex multi-tier distributed platforms during the SDLC and post-production.
  • Design and coordinate new Build/Run initiatives prior to production and assure product readiness including infrastructure recommendations, software/script development, load/chaos testing, optimization, SLO definition, capacity planning, and observation/alerting.
  • Hands-on leadership and guidance of Self-Service performance testing helping teams overcome test scenario design challenges
  • Review source code and identify bottlenecks. Identify opportunities to improve performance and scale.
  • Perform new POCs for newer technologies and architectural patterns to help teams make informed decisions.
  • Define new SLOs for services and applications. Perform workload analysis/help teams define testing scenarios to meet non-functional SLA requirements defined by the business.
  • Work to reduce/minimize ongoing runtime costs through efficient throttling/queuing/pooling/autoscaling across application and infrastructure tiers.
  • Proactively identify anomalies and opportunities in platforms in production to achieve greater performance/scale and recommend to impacted teams for future planning.
  • Define performance quality gates and support canary development CI/CD scenarios around performance for teams
  • Support Performance Analytics, Testing and Observation infrastructure as needed
  • Introduce and evaluate new technologies and tools for performance measurement/observation/profiling/debugging/simulation, etc.

Qualifications:

  • Experience designing and supporting/troubleshooting large scale multi-tier distributed on-premise and cloud applications (in and enterprise level Google Cloud Platform environment)
    • Experienced in Kubernetes, and Google AppEngine
    • Experience leveraging messaging frameworks such as Kafka or Pub/Sub solutions is nice to have
  • Experience architecting, developing and setting up new infrastructure solutions for hybrid in-cloud/on-premise applications
  • Experience in Capacity Planning or Performance Engineering and leveraging predictive analytics to determine needed scaling patterns for platforms
  • Experience in Web Development and/or Web Service creation in languages such as Java, Go, Python highly desired but not required
  • Demonstrable cross-functional knowledge with systems, storage, networking, security and databases.
  • Versed in automating infrastructure (Ansible preferred, though similar experience with Terraform, Cloudformation, etc. is acceptable)
  • Experience using observation tools such as Splunk or Dynatrace
  • Experience with LRU/MRU caching schemes and solutions leveraging MemCached or Redis for achieving scale/performance is a nice to have
  • Experienced in performance tracing/profiling using Google Developer Tools is a nice to have
  • Experience with Jetty, Tomcat/F5 Load Balancers, Apache Web Servers and Adobe Experience Manager is a nice to have



CMK Resources is an IT Staffing company that is focused on recruiting the most talented IT professionals in and around the data center space for world class organizations. CMK is built on a promise to provide a personal experience to each and every customer and consultant. We listen to customer challenges, upcoming projects, culture, and technologies to identify the right talent in the right amount of time. We take the time to understand what our consultants are great at and what environments they excel in. Our entire recruiting process takes place within the U.S. by Technical Recruiters familiar with the IT industry. We provide unrivaled personal service and industry knowledge that exceeds the competition.

Please note, depending on your specific placement, you may be required to prove that you have received the COVID-19 vaccine or have a valid religious or medical reason not to be vaccinated. CMK is an Equal Opportunity Employer and reasonable accommodations will be considered.

Dice Id : 90934249
Position Id : 1821
Originally Posted : 2 months ago
Have a Job? Post it

Similar Positions

Site Reliability Architect
  • Jobot
  • Houston, TX, USA
Site Reliability Architect
  • Jobot
  • Dallas, TX, USA