Title: Data Integration Engineer – GEO (Generative Engine Optimization)
Location: Sunnyvale, CA – Onsite
Duration: Full-Time
Job Description
Summary:
We are seeking a highly skilled Data Integration Engineer with strong experience in building scalable data pipelines, API integrations, and Generative Engine Optimization (GEO) workflows. The ideal candidate should have expertise in Python, ETL processes, third-party data integrations, and exposure to LLMs and prompt engineering.
Key Responsibilities:
Design, build, and maintain scalable data pipelines for GEO workflows
Integrate data from third-party vendors and external data sources
Ensure data quality, schema validation, and consistency across systems
Build and maintain API extraction scripts for internal and external data sources
Develop reusable API connectors with authentication, rate limiting, and error handling
Collaborate with business teams to understand GEO use cases and technical requirements
Automate ETL processes using Python scripting
Apply knowledge of LLMs and prompt engineering to improve GEO outcomes
Evaluate AI/LLM-generated outputs against quality benchmarks and metrics
Document workflows, pipeline designs, and integration processes
Technical Skills:
Strong experience with Python scripting and automation
Expertise in ETL pipelines and data integration frameworks
Experience working with REST APIs, JSON, XML, and external data sources
Knowledge of cloud platforms and scalable data architectures
Understanding of LLMs, Generative AI, and Prompt Engineering
Familiarity with data validation, schema management, and monitoring tools
Required Skills:
Python Development
Data Integration & ETL Pipelines
API Development & Integration
Generative AI / LLM Exposure
Prompt Engineering Knowledge
Strong Problem-Solving & Communication Skills
Qualifications:
Bachelor’s degree in Computer Science, Engineering, or related field
4+ years of experience in Data Engineering / Data Integration roles