AIOps Engineer

Napa, CA, US • Posted 8 hours ago • Updated 8 hours ago

Full Time

No Travel Required

On-site

Depends on Experience

Laiba Technologies LLC

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

Splunk
PowerShell
or Python
Alerts & Logs Monitoring
Confluence
SharePoint
Azure

Summary

Role: AIOps Engineer
Location: Fremont, CA (Onsite role)

Mandatory Skills: Splunk, PowerShell, or Python, Alerts & Logs Monitoring, Confluence and SharePoint
Data Ops Engineer for proactive monitoring, supporting, and ensuring reliability of data pipelines and related infrastructure in an Azure-based ecosystem
5+ years in IT Operations, Data Engineering, or related fields.
Experience in Azure Data Services, ETL/ELT processes, and ITIL-based operations.
2+ years in AIOps implementation, monitoring, and automation.

Skill Requirements:
Basic Understanding of Azure Data Services (ADF, Synapse, Databricks).
Experience in monitoring alerts from data pipelines (Azure Data Factory, Synapse, ADLS, MS Fabric etc.)
Familiarity with ETL/ELT concepts, data validation, and pipeline orchestration.
Experience in identifying failures in ETL jobs, scheduled loads, and streaming data services.
Hands-on experience with IT monitoring tools (e.g. Splunk, Azure Monitor, Dynatrace, or similar tools).
Skilled in creating and updating runbooks and SOPs.
Familiarity with data refresh cycles, batch vs. streaming differences.
Familiarity with ITIL processes for incident, problem, and change management.
Strong attention to detail, ability to follow SOPs, and effective communication for incident updates.
Solid understanding of containerized services (Docker/Kubernetes) and DevOps pipelines (Azure DevOps, GitHub Actions), always with an eye on data layer integration.
Proficiency in Jira, Confluence and SharePoint for status updates and documentation.
Understanding of scripting (PowerShell, Python, or Shell) for basic automation tasks.
Ability to interpret logs and detect anomalies proactively.
Analytical thinking for quick problem identification and escalation.

Preferred:
Exposure to CI/CD for data workflows, real-time streaming (Event Hub, Kafka).
Understanding of Data governance and compliance basics.
Experience with anomaly detection, time-series forecasting, and log analysis.

Key Responsibilities:
Monitor and support data pipelines on Azure Data Factory, Databricks, and Synapse.
Perform incident management, root-cause analysis for L1 issues, and escalate as needed.
Surface issues clearly & escalate to appropriate SME teams so they can be fixed at the root — avoid repetitive short fixes.
Identify whether issues are at pipeline level, data source level, or infrastructure level and route accordingly.
Document incident resolution patterns for reuse.
Acknowledge incidents promptly and route them to the correct team.
Execute daily health checks, maintain logs, and update system status in collaboration tools.
Work strictly as per SOPs documented by the team.
Maintain and update SOPs, runbooks, and compliance documentation.
Update system health status every 2 hours during the shift in Confluence or SharePoint.
Update incident status every 4 hours for P1/P2 tickets.
Complete service tasks on time as per SLA to release queues quickly.
Ensure compliance with enterprise data security, governance, and regulatory requirements.
Collaborate with data engineers, analysts, DevOps/SRE teams and business teams to ensure reliability and security.
Implement best practices in ML operations and productionization.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91078931
Position Id: 8857068
Posted 8 hours ago

Company Info

About Laiba Technologies LLC

Laiba Technology is one of the premier US based IT company. Our corporate office is in USA ,Dubai ,India. We serve government and commercial clients . We provide Software Development,Revenue Cycle Management, Staff Augmentation ,Software Support ,Corporate training etc. We have staffed thousands of contract and full time positions across multiple industries and skill sets. We have steadily grown through word of mouth referrals.

Laiba Technology is one of the reliable and fastest growing Software company serving client globally across the world. The demand for SEO/SMO/PPC, website design and development services and Software solutions worldwide has helped fuel the rapid expansion of We are in international market, where there is great requirement for businesses to increase their online publicity to spur financial growth.

We offer several innovative learning methods and delivery models to cater the unique requirements of a global customer base.
We also provide corporate training's on various cutting edge technologies. We have a team of Certified Trainers with minimum 10+ years of Industry background. Our Training courses are for individuals as well as for corporate. We also undertake customization of the courses as per client requirement.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.