Accelyst
Accelyst is an innovative AI Consultancy that leverages a unique catalog of industry-specific Agents and leading-edge AI platforms to deliver tangible, integrated, secure, and ROI-optimized solutions. We combine deep industry and technical expertise to enable rapid deployment of innovative AI-driven capabilities to augment and automate client workflows for employees, customers, prospects, and investors.
Why Accelyst?
Join Accelyst to be part of a dynamic team that leverages AI-driven technology to make a positive impact. Our leadership, with Big Four Consulting experience, fosters a nimble, client-focused environment, minimizing bureaucracy to enhance delivery and professional growth. You''''''''ll work on complex enterprise technology initiatives that challenge and inspire, meeting high client expectations while collaborating with industry-leading professionals. Additionally, benefit from our profit-sharing model, reflecting our commitment to respect, integrity, and employee success.
Job Summary
Accelyst is seeking a Senior Database Monitoring & Observability Engineer to support enterprise infrastructure initiatives for a large-scale government technology program. This role is responsible for designing, implementing, and maintaining enterprise monitoring solutions that provide comprehensive visibility across applications, databases, infrastructure, cloud environments, and microservices.
The ideal candidate will serve as a subject matter expert in application performance monitoring (APM), infrastructure observability, and proactive system health management using Dynatrace. This individual will collaborate with infrastructure, cloud, database, DevOps, and application teams to optimize performance, automate monitoring, troubleshoot complex production issues, and support Root Cause Analysis (RCA) across hybrid and multi-cloud environments.
Job Roles and Responsibilities
- Design, configure, implement, and maintain enterprise monitoring solutions using Dynatrace.
- Develop dashboards, alerts, reports, and health checks to provide end-to-end visibility across enterprise applications and infrastructure.
- Monitor application performance, availability, user experience, and transaction flows across distributed environments.
- Configure synthetic monitoring and Real User Monitoring (RUM) to proactively identify service degradation.
- Monitor Windows and Linux servers, VMware environments, cloud infrastructure (AWS, Azure, and Google Cloud Platform), middleware, databases, and network components.
- Collaborate with cloud, infrastructure, database, and application teams to identify capacity trends, performance bottlenecks, and optimization opportunities.
- Participate in production incident response and provide monitoring insights during troubleshooting and Root Cause Analysis (RCA).
- Identify monitoring gaps and implement improvements to increase operational visibility and reduce alert fatigue.
- Develop automation scripts using PowerShell, Bash, or similar scripting languages to improve monitoring efficiency.
- Support Kubernetes and containerized application monitoring initiatives.
- Integrate monitoring solutions with IT Service Management (ITSM) platforms to streamline incident management workflows.
- Establish monitoring standards, documentation, and operational best practices across enterprise environments.
- Continuously evaluate monitoring strategies and recommend enhancements to improve system reliability, performance, and availability.
- Provide technical leadership and guidance to junior engineers and cross-functional teams.
- Perform additional duties as assigned.
Job Requirement:
Education
- Bachelor''''''''s degree in Computer Science, Information Systems, Information Technology, Engineering, or a related technical discipline, or equivalent professional experience.
Required Experience
- 5+ years of experience in Systems Engineering, Infrastructure Support, Monitoring, or Observability Engineering.
- 5+ years of hands-on experience implementing and supporting Dynatrace or comparable Application Performance Monitoring (APM) platforms.
- Experience supporting enterprise production environments with high availability requirements.
- Experience working within hybrid, multi-cloud, and microservices architectures.
- Experience participating in production support, incident management, and Root Cause Analysis (RCA).
Technical Skills
- Strong experience with Dynatrace administration, dashboard creation, alert configuration, and performance monitoring.
- Knowledge of Windows and Linux server administration.
- Experience monitoring VMware virtual environments.
- Working knowledge of AWS, Microsoft Azure, and Google Cloud Platform (Google Cloud Platform).
- Familiarity with Kubernetes and container monitoring.
- Understanding of databases, middleware, and enterprise application architectures.
- Basic networking knowledge including DNS, TCP/IP, load balancing, and network performance monitoring.
- Experience with scripting and automation using PowerShell, Bash, or similar technologies.
- Familiarity with ITIL processes and IT Service Management (ITSM) platforms.
- Experience supporting regulated industries such as healthcare, financial services, or government environments is preferred.
Professional Skills
- Excellent analytical and troubleshooting abilities.
- Strong problem-solving and Root Cause Analysis skills.
- Exceptional written and verbal communication skills.
- Ability to collaborate effectively across cross-functional technical teams.
- Detail-oriented with a proactive approach toward monitoring, automation, and operational excellence.
- Ability to prioritize multiple initiatives within fast-paced enterprise environments.