![]()
Position: Application Support Engineer (SRE / DevOps Focus)
Location: Hybrid (Jersey City, NJ or Coppell, TX)
Employment Type: Contract-to-Hire
Role Overview
We are seeking a highly skilled Application Support Engineer with strong experience in Site Reliability Engineering (SRE), DevOps, and production support. This role focuses on maintaining and supporting critical applications in a production environment, ensuring high availability, performance, and reliability.
You will partner closely with engineering, infrastructure, and business teams to proactively identify issues, improve system resiliency, and drive automation across the application lifecycle.
Key Responsibilities
Application Support & Operations
- Provide L2/L3 production support for enterprise applications
- Troubleshoot and resolve technical incidents, outages, and performance issues
- Participate in incident management and root cause analysis (RCA)
Reliability Engineering (SRE)
- Define and implement SLIs, SLOs, and SLAs
- Design solutions for high availability, fault tolerance, and resiliency
- Improve system observability, monitoring, and alerting capabilities
Release & Deployment Support
- Collaborate with release teams to ensure smooth production deployments
- Validate operational readiness before major releases
- Support deployment validation and post-release monitoring
Monitoring & Observability
- Build and optimize monitoring and alerting frameworks
- Leverage tools (e.g., Grafana or similar) for real-time insights
- Use AI/ML-based analytics for anomaly detection and proactive issue identification
Automation & Efficiency
- Develop automation scripts and tools to:
- Reduce manual work
- Improve recovery times
- Enable self-healing systems
- Implement CI/CD pipelines and infrastructure-as-code practices
Performance & Scalability
- Conduct capacity planning and performance tuning
- Ensure systems scale effectively under high workload conditions
Risk & Operational Readiness
- Identify and mitigate operational and technical risks
- Support disaster recovery planning and testing
- Ensure systems meet security and compliance standards
Continuous Improvement
- Define and track KPIs and reliability metrics
- Drive ongoing improvements in system performance and stability
Collaboration & Culture
- Work across engineering, DevOps, and business teams
- Promote best practices in SRE, automation, and operational excellence
- Mentor team members and support knowledge sharing
Required Qualifications
- 8+ years of experience in application support, DevOps, or SRE roles
- Strong experience with:
- Production support and incident management
- Monitoring and observability tools
- CI/CD pipelines and automation frameworks
- Proficiency in one or more programming languages:
- Python, Java, Go, or similar
- Experience with:
- Cloud platforms (AWS, Azure, or Google Cloud Platform)
- Containerized or hybrid environments
- Strong understanding of:
- Performance tuning
- System reliability and scalability
- Excellent communication and collaboration skills
Dexian stands at the forefront of Talent + Technology solutions with a presence spanning more than 70 locations worldwide and a team exceeding 10,000 professionals. As one of the largest technology and professional staffing companies and one of the largest minority-owned staffing companies in the United States, Dexian combines over 30 years of industry expertise with cutting-edge technologies to deliver comprehensive global services and support.
Dexian connects the right talent and the right technology with the right organizations to deliver trajectory-changing results that help everyone achieve their ambitions and goals. To learn more, please visit .
Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.