Job Title: Splunk Monitoring and Incident Management Engineer
Location: Boston, MA
Duration: 12+ Months
Experience Required: 6–8 Years
Role Overview
We are seeking a skilled Splunk Monitoring and Incident Management Engineer with strong experience in logging, monitoring, and operational management using the Splunk platform. The role focuses on developing dashboards and reports, correlating log data, and supporting incident management processes to ensure platform reliability and operational efficiency.
The ideal candidate will have hands on experience with Splunk based monitoring environments, operational runbooks, and incident triage while collaborating with technical teams to improve monitoring use cases and operational workflows.
Key Responsibilities
Implement and manage logging and monitoring solutions using the Splunk platform.
Develop dashboards and reports to monitor system performance and operational metrics.
Analyze and correlate log data to identify patterns, anomalies, and operational issues.
Support operationalization of the Splunk platform including development of standard operating procedures and incident response workflows.
Perform incident triage and support remediation activities for system alerts and operational incidents.
Collaborate with technical teams to improve monitoring strategies and platform performance.
Participate in the development and implementation of monitoring use cases and operational improvements.
Support project work and delivery related to monitoring, incident management, and platform operations.
Required Skills and Qualifications
Education
Bachelor’s degree in Computer Science, Information Technology, or a related field is preferred.
Experience
6–8 years of experience working with monitoring and logging platforms.
Hands on experience with Splunk for logging, monitoring, and incident analysis.
Experience supporting operational monitoring environments and incident management processes.
Technical Skills
Strong experience with the Splunk platform for logging, monitoring, and analytics.
Experience developing Splunk dashboards and operational reports.
Experience correlating log data to identify system events and performance issues.
Experience implementing operational procedures including triage workflows and incident remediation processes.
Knowledge of monitoring frameworks and incident management practices.
Soft Skills
Strong communication and collaboration skills.
Ability to work with cross functional teams and support operational environments.
Strong analytical and problem solving abilities.
Ability to manage multiple tasks in a project driven environment.
Preferred Qualifications
Experience working within monitoring use case development lifecycles.
Experience supporting project based monitoring implementations and platform enhancements.
Familiarity with incident management frameworks and operational support models.