Applicants must be currently authorized to work in the US on a full-time basis now and in the future. We are seeking a Data Engineer to build and maintain curated, reliable datasets in a Palantir Foundry environment. You will transform complex operational and enterprise data into actionable insights, validate datasets using Power BI, and ensure data quality, governance, and access controls. Candidates must be able to maintain a DoD security clearance.
Design, implement, and optimize PySpark, Python, and SQL pipelines to ingest, clean, transform, and normalize large-scale, heterogeneous datasets for analytics, reporting, and ML workflows. Collaborate with stakeholders to elicit requirements and translate operational or incomplete datasets into curated, schema-compliant, production-ready data products, ensuring data integrity, lineage, and consistency across pipelines.
Required Skills & Experience: - Proficiency in PySpark, Python, and SQL for large-scale data transformation.
- Knowledge of data modeling, semantic layers, graph-based structures, or ontology design.
- Understanding of data governance, access controls, and compliance practices.
- Ability to maintain a DoD security clearance.
Desired Skills & Experience: - Experience designing ETL pipelines for batch and streaming data.
- Ability to optimize pipelines for performance and scalability in distributed environments.
- Prior experience in defense, government, or high-security environments.
Location: Onsite 5 days a week in San Antonio, Tx
Compensation: Annual performance based bonus
You will receive the following benefits: - Dental Benefits
- Vision Benefits
- Paid Time Off (PTO)
- 401(k)