Overview
Hybrid3 days a week onsite
$40,000 - $60,000
Full Time
Skills
Production Support
Software
Scripting
SQL
Git
Monitoring
Job Details
Junior Production Support Specialist New York, NY 10022 - 3 days a week onsite work Full time opportunity
in-person interview required
in-person interview required
Looking for a Junior Production Support Specialist (JPSS) to join our Customer Success team. The JPSS would strengthen L2 support by acting as a bridge between L1 (Customer Success) and core development (L3). Their focus is on internal systems, logs, and platform health, which is where most technical escalations land. The JPSS would work in sync with the Technical Support Representatives (L1) and provides technical insights that can feed back into client-facing knowledge bases, helping the JTSR resolve similar issues in the future. This role will report to the Head of Core Platform.
Responsibilities
- Investigate system-level issues like API errors, configuration failures, or performance bottlenecks flagged by L1.
- Monitor production health (Datadog, Elastic) to proactively identify and resolve issues before clients escalate them.
- Provide temporary workarounds and quick fixes while L3 (Development) works on long-term resolutions.
- Collaborate with Implementation teams to troubleshoot integration-specific or client-specific technical problems.
- Escalation Role: When root cause points to a code-level defect, JPSS would escalate to L3 with detailed incident reports, logs, and debugging evidence.
Who you are
- A collaborative communicator who can align teams and stakeholders across the business.
- A self-starter who thrives in fast-paced, entrepreneurial environments.
- Data-driven and analytical, able to interpret results and optimize for performance.
Technical Skills:
- 1-2 years of production support
- Some experience working with Cloud technologies (AWS, multi-tenant SaaS) preferred
- Experience collaborating cross functionally with DevOps, SRE, engineering, product operations teams
- Monitoring & observability tools (Datadog, Elastic, Grafana)
- Scripting (Bash, Python) & SQL for troubleshooting
- CI/CD familiarity (Jenkins, Git)
- Strong incident triage under pressure
- May require on-call duties for after-hours production incidents
Success Metrics:
- Uptime & SLA compliance
- Mean time to recovery (MTTR)
- Smooth deployments & minimized incidents
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.