Big Data Engineer - Onsite - Rockville, MD or Tysons Corner, VA
Remote • Posted 2 days ago • Updated 17 minutes ago

FutureTech Consultants LLC
Dice Job Match Score™
📋 Comparing job requirements...
Job Details
Skills
- Big Data
- Python
- Amazon S3
- Agile
- AWS
- S3
- Spark
- hive
- SQL
- GLUE
- ETL
- hadoop
Summary
Title: Big Data Engineer (W2 only)
Duration: 6+ months
Work location: Rockville, MD or Tysons Corner, VA
Any Visa
Mode of Interview: 2 rounds of Video calls
Seeking a highly skilled and experienced Big Data Engineer to design, build, and optimize large-scale data platforms and distribute processing systems at our FinTech customer. This role is critical in enabling data-driven decision-making across the organization by delivering scalable, reliable, and high-performance data solutions.
The ideal candidate has deep expertise in distributed computing, cloud platforms, and modern big data technologies such as Apache Spark, Hadoop, Hive, and Trino. This individual will work closely with data scientists, analysts, product teams, and engineering stakeholders to architect and implement robust data pipelines and enterprise-grade data platforms. The role also requires strong software engineering practices, AI-assisted development proficiency, and the ability to optimize systems handling petabyte-scale data.
Responsibilities
Design, develop, and maintain large-scale data pipelines using modern big data technologies such as Spark, Hadoop, Hive, and Trino.
Build scalable and reliable solutions for data ingestion, transformation, storage, and analytics.
Architect distributed data platforms capable of processing massive (petabyte-scale) datasets.
Optimize and enhance existing data pipelines for performance, scalability, cost efficiency, and reliability.
Implement automated testing frameworks and continuous validation for data quality and pipeline accuracy.
Develop unit, integration, and end-to-end test strategies for data platforms.
Collaborate with cross-functional teams to translate business requirements into scalable data solutions.
Support data scientists and analytics teams by delivering high-quality, production-ready datasets.
Monitor, troubleshoot, and resolve data pipeline issues in production environments.
Investigate and resolve challenges such as data skew, resource constraints, job failures, and large-scale system bottlenecks.
Apply Spark tuning techniques including partitioning, caching, broadcast joins, and performance optimization.
Ensure strong software engineering practices, including version control, code quality, and CI/CD automation.
Stay current with emerging big data, cloud, and AI technologies to continuously improve data architecture.
Drive AI-enabled development practices, including prompt engineering, AI-assisted coding, and workflow optimization.
Partner with stakeholders to ensure regulatory, governance, and financial data integrity requirements are met.
Qualifications
Required:
Bachelor s degree in computer science, Information Systems, or a related discipline, or equivalent practical experience.
5+ years of experience designing and implementing big data and distributed systems.
Strong expertise in Apache Spark and its architecture (executors, stages, DAG, tasks).
Hands-on experience with big data technologies such as Hadoop, Hive, and Trino.
Strong proficiency in Python, Scala, or Java with a focus on scalable and modular code.
Extensive experience writing advanced SQL queries including window functions, complex joins, and aggregations.
Experience working with large-scale datasets and troubleshooting performance or scalability challenges.
Hands-on experience with cloud platforms such as AWS, including S3, EMR, Glue, Lambda, and Athena.
Experience designing and maintaining production ETL and data processing systems.
Strong understanding of distributed system performance tuning and resource optimization.
Experience implementing CI/CD pipelines and automated testing in data engineering environments.
Strong understanding of Agile methodologies such as Scrum and Kanban.
Excellent communication and collaboration skills.
Ability to work in fast-paced, dynamic environments and manage competing priorities.
- Dice Id: 10107680
- Position Id: 8888674
- Posted 2 days ago
Company Info
About FutureTech Consultants LLC
Company Overview
Founded in 1997, FutureTech is a leader in information technology (IT) and business process services with the expertise and capabilities to serve the needs of businesses of all sizes serving clients world-wide. FutureTech provides end-to-end services through our flexible and cost efficient delivery model that combines onsite and offsite operations. Rooted in quality processes and frameworks, our goal is to fully satisfy client objectives, serving as an accountable, flexible and objective partner.
FutureTech is headquartered in Norcross, Georgia and we have sales offices located in Chicago and Dallas. We also have development facilities in Hyderabad, India.
Services and Capabilities
From providing consulting and systems integration services to managing IT and business functions on behalf of our clients, FTC provides an all-encompassing offering that includes the planning, design, development, implementation and management of highly effective business and IT environments. Our global delivery capabilities include onsite and offshore options, giving our clients the flexibility to choose the model that best meets their needs.
Through our vertical expertise in financial, government, healthcare, telecommunications, utilities, manufacturing, distribution and retail services, we provide clients with a partner that is not only expert in IT, but expert in their industries. This combination of business knowledge and technology expertise allows us to help our clients adapt as their industries change.


Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs