Senior Systems Engineer - Observability (SSE)

Overview

On Site
Full Time

Skills

Enterprise Software
Research
Systems Analysis/design
Computer Science
Information Technology
Onboarding
DQL
SPL
Dashboard
ServiceNow
Windows PowerShell
Regular Expression
Python
JavaScript
Ansible
Terraform
Artificial Intelligence
Communication
Project Planning
SAFE
Conflict Resolution
Problem Solving
Change Management
Testing
High Availability
Attention To Detail
Kubernetes
Amazon EC2
Amazon Web Services
Microsoft Azure
Cloud Computing
IaaS
PaaS
SaaS
Scripting
Optimization
IT Infrastructure
Dynatrace
Splunk
IT Management
Reporting
Leadership
Service Delivery
Policies and Procedures
SLA
Storage
Interfaces
Configuration Management
Application Development
Collaboration
Provisioning
Continuous Integration and Development
Training
Systems Engineering
Operational Risk
Production Support
Management
Status Reports
Presentations
Organized
SAP BASIS
Law
Health Care
Life Insurance
Insurance

Job Details

Job Description

JOB SUMMARY

The Sr. Systems Engineer - Observability (SSE) role will define and implement infrastructure and application logging, setup governance, optimization, monitoring and controls for observability platform. The role will work with engineering, application and enterprise/solution architects to develop, implement and support logging, monitoring, reporting and automation for infrastructure and application services where applicable. This role serves as a subject matter expert in a complex array of full-stick solutions. This role serves as a subject matter expert performing research, analysis, design, creation, and implementation to meet current and future requirements across the enterprise.

CANDIDATE PROFILE

Education and Experience Required:

Undergraduate degree in engineering or computer science discipline and/or equivalent experience/certification

7+ years' experience in information technology with hands-on technical/engineering roles including:

5+ years' admin experience Dynatrace/Grail/Splunk Cloud/Cribl, etc.

3+ years' experience in AWS cloud platforms log ingestion solutions

3+ years data onboarding within a large-scale enterprise environment

Experience in implementing and maintaining Dynatrace/Grail or other enterprise observability solutions.

Experience in Dynatrace Query Language (DQL) and/or Splunk Processing Language (SPL) including building dashboards, reports and alerts to meet customer requirements.

Experience in integrating observability tools with other ITOps solutions (Harness, ReadyAPI, ServiceNow, BigPanda, etc.)

Additional Preferred Experiences:

Dynatrace Certified Admin and/or Splunk Certified Admin

Scripting experience in at least one of the following: PowerShell, Regex, Python, JavaScript, Ansible and Terraform.

Strong knowledge of emerging tools, software, applications, and AI solutions for attaining best-in-class IT technology across the enterprise.

Experience in building scalable pipelines for collecting, processing, and analyzing metrics, logs, and traces.

Experience in establishing and implementing Observability best practices to standardize, monitor and control usage/performance of solutions.

Excellent verbal and written communication skills for a wide range of audiences including executives, business stakeholders and IT teams

Project planning and management experience.

Experience operating in Scaled Agile Framework

Demonstrated experience delivering technology solutions in a fast-paced, deadline driven enterprise environment.

Demonstrated experience learning and applying new technologies to solve business needs

Excellent problem-solving skills working independently and through leading outcomes for cross functional teams.

Excellent understanding of change management, testing requirements, techniques, and tools to ensure high availability of systems

Strong attention to detail with an ability to operate effectively across multiple priorities

CORE WORK ACTIVITIES

Design, implement, and maintain high-performance and scalable observability solutions for Kubernetes - EKS/ACK, ROSA, DocumentDB, EC2 and other data sources in a complex enterprise environment.

Collaborate with cross-functional teams to gather requirements, architect solutions, and deploy logging and monitoring environments that align with business needs.

Leverage in-depth knowledge of AWS, Azure and Alibaba Cloud technologies, including IaaS, PaaS, and SaaS, to architect and manage logging and monitoring tools' deployments.

Enable streamlined operational processes and efficient management of the Dynatrace infrastructure using scripting and automation.

Responsible for infrastructure-as- code development and configuration management.

Lead optimization efforts for observability platform and explore alternative solutions using other automation technologies like Cribl, etc.

Onboard data sources from various IT infrastructure and app. components into observability tools (Dynatrace/Grail, Splunk,SignalFx, Cribl).

Provide technical leadership, oversight, governance and direction for services related to Marriott solution delivery.

Provide technical expertise to project team for successful project and change implementations

Determine customer requirements and work with sourced resources to develop solutions

Provide and present status, analysis and reporting to internal stakeholders, Executive Management and Senior Leadership.

Lead analysis of current environment for deficiencies and provides solutions

Identify opportunities to enhance the service delivery, operations and continual service improvement processes.

Delivering Technology

Creates and enhances administrative, operational and technical policies and procedures, adopting best practice guidelines, standards and procedures for employees, contractors and vendor engagements.

Management of daily infrastructure operations to ensure availability SLA is met for storage services

Interfaces with stakeholders to establish requirements and formulate priorities for infrastructure projects.

Leads/assists in configuration management

Works in a concerted effort with application development and engineering teams to resolve complex issues.

Provides oversight, collaboration, provisioning, management and maintenance of technology products and service alternatives that improve the production services environment

Responsible for the establishment and continuous development of monitoring and alerting for all production environments.

Develops internal processes and training to ensure team members have the needed skills and tools to support production environments and deliver project commitments.

Performs complex analyses for operational availability to promote a zero-defect environment.

Leads/assists operational teams in system updates & upgrades

Provides consultation for routine and complex systems development

Maintains a proper balance between business and operational risk

Facilitates achievement of expected deliverables and obligations of Services Providers

Ensures early warning to the business stakeholder executives regarding degraded or missed SLAs

Coordinates with Product and Architecture & Development teams for deployment and production support activities.

Managing Work, Projects, and Policies

Manages and implements work and projects as assigned.

Generates and provides accurate and timely results in the form of reports, presentations, etc.

Analyzes information and evaluates results to choose the best solution and solve problems.

Provides timely, accurate, and detailed status reports as requested.

Delivering on the Needs of Key Stakeholders

Understands and meets the needs of key stakeholders.

Develops specific goals and plans to prioritize, organize, and accomplish work.

Determines priorities, schedules, plans and necessary resources to ensure completion of any projects on schedule.

Collaborates with internal partners and stakeholders to support business/initiative strategies

Communicates concepts in a clear and persuasive manner that is easy to understand.

Generates and provides accurate and timely results in the form of reports, presentations, etc.

Demonstrates an understanding of business priorities

Additional Responsibilities

Manages time effectively and conducts activities in an organized manner.

Presents ideas, expectations and information in a concise, organized manner.

Performs other reasonable duties as assigned by manager.

At Marriott International, we are dedicated to being an equal opportunity employer, welcoming all and providing access to opportunity. We actively foster an environment where the unique backgrounds of our associates are valued and celebrated. Our greatest strength lies in the rich blend of culture, talent, and experiences of our associates. We are committed to non-discrimination on any protected basis, including disability, veteran status, or other basis protected by applicable law.

About Us

All positions offer a 401(k) plan, stock purchase plan, discounts at Marriott properties, commuter benefits, employee assistance plan, and childcare discounts. Benefits are subject to terms and conditions, which may include rules regarding eligibility, enrollment, waiting period, contribution, benefit limits, election changes, benefit exclusions, and others. Click here to learn more.

Full-time positions also offer coverage for medical, dental, vision, health care flexible spending account, dependent care flexible spending account, life insurance, disability insurance, accident insurance, adoption expense reimbursements, paid parental leave and educational assistance.

Washington Applicants Only: Employees will accrue paid sick leave, 0.077 PTO balance for every hour worked and be eligible to receive a minimum of 9 holidays annually.

Marriott HQ is committed to a hybrid work environment that enables associates to Be connected. Headquarters-based positions are considered hybrid, for candidates within a commuting distance to Bethesda, MD; candidates outside of commuting distance to Bethesda, MD will be considered for Remote positions.

About the Team

Marriott International is the world's largest hotel company, with more brands, more hotels and more opportunities for associates to grow and succeed. Be where you can do your best work, begin your purpose, belong to an amazing global team, and become the best version of you.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.