AI Ops Engineer

Fremont, CA, US • Posted 12 hours ago • Updated 12 hours ago
Contract W2
Contract Corp To Corp
On-site
Depends on Experience
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Amazon Web Services
  • Analytical Skill
  • Artificial Intelligence
  • BMC Remedy
  • Change Management
  • Cloud Computing

Summary

We are looking for AI Ops Engineer for our client in Fremont, CA
Job Title: AI Ops Engineer
Job Location: Fremont, CA
Job Type: Contract
Job Overview:
Pay Range: $55hr - $60hr
  • The AI Ops Engineer is responsible for monitoring, analyzing, and maintaining IT systems using automation and AIOps practices.
  • This role focuses on proactive monitoring, incident management, and improving system reliability through automation and operational best practices.
  • The candidate will collaborate with cross-functional teams to ensure system performance, security, and continuous improvement.
Experience:
  • 5+ years of experience in IT operations or L1 support roles.
  • Exposure to AIOps environments or automated monitoring solutions is preferred.
Responsibilities:
  • Monitor alerts proactively and detect anomalies from logs.
  • Perform daily system health checks until full automation is implemented.
  • Follow status checks and operational procedures as defined in runbooks.
  • Create and update runbooks and SOPs to reflect current processes.
  • Update system health status periodically in documentation tools.
  • Acknowledge incidents promptly and route them to appropriate teams.
  • Provide timely updates for high-priority incidents and ensure resolution within defined SLAs.
  • Communicate effectively with users regarding incident status and requests.
  • Complete service tasks within SLA timelines.
  • Follow documented procedures and maintain compliance with operational standards.
  • Collaborate with data engineers, DevOps, and business teams to ensure system reliability and security.
  • Implement best practices in machine learning operations and production environments.
  • Ensure compliance with enterprise data security, governance, and regulatory requirements.
Skills:
  • IT monitoring tools such as Splunk, Nagios, Zabbix, and Prometheus.
  • Scripting using PowerShell, Python, or Shell.
  • Log monitoring and anomaly detection.
  • Confluence and SharePoint for documentation and reporting.
  • ITIL processes including incident, problem, and change management.
  • Ticketing systems such as ServiceNow, Jira, and Remedy.
  • Strong analytical, problem-solving, and debugging skills.
  • Excellent communication and documentation skills.
  • Ability to follow and maintain runbooks and SOPs.
Qualification And Education:
  • Bachelor s or Master s degree in Computer Science, Engineering, or a related field.
Should Have:
  • ITIL Foundation Certification.
  • Experience with anomaly detection, time-series forecasting, and log analysis.
  • Certifications in monitoring tools or cloud platforms such as AWS or Azure.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10516350
  • Position Id: CA_AIOE_0330
  • Posted 12 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote or Ohio

Today

Easy Apply

Full-time, Third Party

$65

Fremont, California

Today

Easy Apply

Third Party, Contract

Depends on Experience

Hybrid in Pleasanton, California

30+d ago

Easy Apply

Contract

Depends on Experience

Sunnyvale, California

Today

Contract

$55 - $62 hourly

Search all similar jobs