Location: Irving, TX
Salary: $53.00 USD Hourly - $57.00 USD Hourly
Description: Data Engineer, IAM Data Lake (Google Cloud Platform)We are not accepting C2C or 1099 arrangements.Locations: Irving, TX (Dallas) or Columbus, OH
Employment Type: Contingent / Contract
About the RoleWe are seeking a
Data Engineer to design and build scalable
IAM Data Lake solutions on
Google Cloud Platform (Google Cloud Platform). In this role, you will contribute to moderately complex information security engineering initiatives, focusing on data ingestion, transformation, and secure data delivery within a cloud-native architecture.
You will partner with cross-functional teams to develop reliable data pipelines, ensure data integrity, and support identity and access management (IAM) analytics use cases.
Responsibilities- Design, build, and maintain data lake architectures on Google Cloud Platform
- Develop and optimize batch and streaming data pipelines using Google Cloud Platform-native services
- Implement Pub/Sub-based streaming solutions, including schema design, evolution, and versioning
- Build and manage data ingestion frameworks, including incremental loads and Change Data Capture (CDC)
- Work with columnar data formats (Parquet, Avro, ORC) and optimize compression strategies
- Apply best practices for data modeling, transformation, and processing using tools such as PySpark
- Define and enforce data storage standards (bucket structure, naming conventions, lifecycle policies, access controls)
- Enable data consumption through views, APIs, and curated datasets
- Collaborate with security and engineering teams to meet information security and compliance requirements
- Support CI/CD processes for data pipeline deployment and maintenance
Minimum Qualifications- 4+ years of experience in Data Engineering, Information Security Engineering, or a related field
- Experience with Google Cloud Platform (Google Cloud Platform) services and architecture
- Strong background in data pipeline development and data processing
- Hands-on experience with PySpark
- Experience working with APIs and workflow orchestration tools (e.g., Airflow)
- Familiarity with CI/CD practices and tools
Preferred Qualifications- Experience building Data Lakes on Google Cloud Platform using big data technologies
- Knowledge of Hadoop ecosystem (HDFS)
- Experience with streaming architectures and real-time data processing
- Understanding of IAM data and security-focused data models
- Familiarity with event-driven architecture and schema management best practices
- Experience with data modeling and analytics-ready dataset design
Key Skills- Google Cloud Platform (4-6 years)
- Data Processing & Pipelines (2-6 years)
- PySpark (4-6 years)
- CI/CD (4-6 years)
- Airflow (2-4 years)
- APIs (2-4 years)
- Hadoop Ecosystem (2-4 years)
- Data Modeling (1-2 years)
- AVRO / columnar formats (preferred)
Additional Information- Preferred work location: Dallas, TX (Irving)
- Alternate location: Columbus, OH
- Please submit candidates to one requisition only (shared hiring manager across multiple roles)
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact: This job and many more are available through The Judge Group. Please apply with us today!