Data Integration Engineer – Generative Engine Optimization


Carlin Shayn Inc
Dice Job Match Score™
📋 Comparing job requirements...
Job Details
Skills
- ELT
- Large Language Models (LLMs)
- Python
- SQL
- Machine Learning (ML)
- Data Integration
- Data Modeling
- Authentication
- GEO
- Generative Engine Optimization
- Retrieval-Augmented Generation
- API
- Microsoft Azure
- Google Cloud Platform
- Cloud Computing
- AWS
- Data Warehouse
- REST APIs
Summary
Location: Sunnyvale, CA (Onsite)
Employment Type: Contract / C2C
Client: Apple (Through Wipro)
Position Overview
We are seeking a highly skilled Data Integration Engineer with strong expertise in Data Engineering, API Integrations, Python Development, and Generative Engine Optimization (GEO) to join a strategic initiative supporting cutting-edge AI and search optimization programs.
The ideal candidate will have experience building scalable data pipelines, integrating diverse data sources, and supporting Large Language Model (LLM) workflows. This role requires close collaboration with business stakeholders, AI teams, and data engineering teams to develop robust data solutions that enhance Generative Engine Optimization outcomes.
This is an onsite opportunity in Sunnyvale, CA, and local candidates are strongly preferred.
Key Responsibilities
Data Engineering & Pipeline Development
- Design, develop, and maintain scalable, reliable, and high-performance data pipelines for GEO applications.
- Build ETL/ELT workflows to ingest, transform, validate, and serve data from multiple structured and unstructured sources.
- Optimize data processing workflows to improve performance, scalability, and reliability.
- Implement monitoring, logging, and alerting mechanisms for production data pipelines.
Data Integration & API Development
- Integrate data from third-party vendors, external platforms, and internal systems to support GEO initiatives.
- Develop reusable API connectors and extraction frameworks.
- Build and maintain API ingestion processes with authentication, rate limiting, retry mechanisms, and error handling.
- Ensure seamless integration between various enterprise data platforms and AI systems.
Data Quality & Governance
- Establish data validation rules and quality checks.
- Ensure consistency, completeness, and integrity of datasets utilized by GEO models.
- Implement schema validation and metadata management practices.
- Troubleshoot and resolve data quality and integration issues.
Generative Engine Optimization (GEO) Support
- Collaborate with AI and business teams to support Generative Engine Optimization initiatives.
- Prepare and curate datasets used in LLM-powered applications.
- Support evaluation and testing of AI-generated outputs against GEO benchmarks and quality standards.
- Assist in optimizing content, data structures, and retrieval mechanisms for improved LLM performance.
AI, LLM & Prompt Engineering
- Apply practical knowledge of Large Language Models (LLMs) and prompt engineering techniques.
- Support retrieval-augmented generation (RAG) workflows and AI data preparation processes.
- Analyze model outputs and recommend improvements to enhance GEO effectiveness.
Stakeholder Collaboration
- Partner with business stakeholders to gather requirements and translate them into scalable technical solutions.
- Work closely with cross-functional teams including Data Engineering, AI/ML, Product, and Business Operations.
- Provide technical guidance on integration strategies and data architecture.
Documentation & Knowledge Sharing
- Create and maintain detailed technical documentation for data pipelines, integrations, workflows, and architecture.
- Document best practices, operational procedures, and troubleshooting guides.
- Participate in knowledge-sharing sessions and technical reviews.
Required Qualifications
- Bachelor''s degree in Computer Science, Information Systems, Engineering, or related field.
- 5+ years of experience in Data Engineering, Data Integration, or similar roles.
- Strong hands-on experience with Python programming.
- Experience designing and developing scalable ETL/ELT pipelines.
- Strong experience integrating data from APIs, third-party vendors, and external platforms.
- Proficiency in SQL and data modeling concepts.
- Experience working with cloud-based data platforms and distributed data processing systems.
- Strong understanding of data quality, validation, governance, and monitoring practices.
- Experience with REST APIs, authentication methods, and API integration frameworks.
- Excellent troubleshooting, analytical, and problem-solving skills.
- Strong verbal and written communication skills.
Preferred Qualifications
- Experience supporting Generative Engine Optimization (GEO) initiatives.
- Hands-on experience with Large Language Models (LLMs).
- Knowledge of Prompt Engineering techniques.
- Familiarity with Retrieval-Augmented Generation (RAG) architectures.
- Experience evaluating AI-generated content and model outputs.
- Exposure to AI/ML workflows and data preparation processes.
- Experience working in large-scale enterprise environments.
- Previous experience supporting Apple, Wipro, or similar technology-driven organizations.
Technical Skills
Programming & Scripting
- Python
- SQL
Data Engineering
- ETL/ELT Development
- Data Pipelines
- Data Modeling
- Data Integration
API & Integration
- REST APIs
- API Development
- Third-Party Integrations
- Authentication & Authorization
AI & GEO
- Generative Engine Optimization (GEO)
- Large Language Models (LLMs)
- Prompt Engineering
- AI Content Evaluation
- Retrieval-Augmented Generation (RAG)
Cloud & Data Platforms
- AWS / Azure / Google Cloud Platform (Preferred)
- Data Warehousing Technologies
- Distributed Data Processing Frameworks
- Dice Id: 91171966
- Position Id: 8987125
- Posted 3 hours ago
Company Info
About Carlin Shayn Inc
With years of industry expertise, we understand the challenges businesses face in finding the perfect candidate. Our team works closely with employers to craft personalized recruitment solutions.
Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs