ClifyX group is an award winning IT Consultancy formed in 1998. Our Mission is to provide our clients with Optimal Technology solutions that are effective and within budgets. We specialize in helping Organizations to review their strategic SOW Projects/Talent needs and implement high value and cost effective solutions to increase profitability and efficiency. Our consulting capabilities include expertise in Cloud, Artificial Intelligence, Data Analytics and compliance aspects of Cyber Security development.
Work With Us : What makes working at ClifyX so great? We give tons of flexibilty, listen to you, give opportunity to work in different technologies, give feedback on your profile and present you to global fortune companies.
Job Title: Data Engineer
Location: Santa Clara, CA (Hybrid)
Type: W-2 Contract
7+ years of experience in data engineering or related fields.
Strong handson experience with:
AWS services: Glue, S3, Redshift, EMR, Lambda, Kinesis, Athena.
Big Data tech: Spark/PySpark, Hadoop, Hive.
Programming: Python, SQL, Scala (optional).
Databases: SQL Server, PostgreSQL, MySQL, NoSQL (DynamoDB, MongoDB).
Experience with CI/CD, DevOps, and IaC tools.
Strong understanding of data modeling, warehousing, and distributed computing.
Data Pipeline & ETL Development
- Design, build, and maintain scalable ETL/ELT pipelines using AWS services (Glue, Lambda, EMR, Step Functions).
- Develop batch and realtime data ingestion processes from diverse sources (APIs, RDBMS, streaming platforms).
- Optimize data workflows for performance, scalability, and cost-efficiency.
Data Platform Engineering
- Architect and implement data lakes and data warehouses using S3, Redshift, Lake Formation, Athena.
- Manage data modeling (star/snowflake schemas) and design optimized storage layers.
- Implement data cataloging, metadata management, and data lifecycle policies.
Big Data & Analytics
- Work with big data tools such as Spark, Hadoop, Hive, and PySpark.
- Support analytics and machine learning teams by providing highquality, curated datasets.
- Cloud Infrastructure & DevOps
- Build CI/CD pipelines for data engineering (CodePipeline, CodeBuild, GitHub Actions).
- Write IaC using Terraform or AWS CloudFormation.
- Monitor, troubleshoot, and optimize workloads using CloudWatch and distributed logging.
Data Quality & Governance
- Implement data validation frameworks and automated quality checks.
- Ensure compliance with security, privacy, and governance standards (IAM, KMS, encryption)
We are an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, gender identity, sexual orientation, disability status, protected veteran status, or any other characteristic protected by law.