apache spark Jobs in pennsylvania

Refine Results
1 - 20 of 320 Jobs

Integration Lead (IICS, Spark, Azure Data Lake)

Ztek Consulting

Remote

Full-time

Note: This is a fulltime and remote opportunity. Role: Integration Lead (IICS, Spark, Azure Data Lake) Work location: Remote Job Description 10+ years of experience in Informatica ETL Design and Architecture, Data Analysis & Data ManagementDemonstrated expertise in designing and implementing data architecture solutions using Informatica Cloud and AzureImplemented an Informatica based ETL solution fulfilling stringent performance requirementsExperience in designing and implementing ETL processe

Spark Architect

DCE Infosec LLC

Remote

Full-time

**No Employer **Must have 15 to 20+ years of experience Mandatory Skills - Spark, SCALA, Kafka, Streaming, technical architecture, Data Bricks, Data Engineering, data quality, Data Governance. Job DescriptionHandson with Spark (dataframe), Spark SQL, Databricks, AWS Glue, Scala/Spark or PySpark, Kafka or another streaming technology. Good learning & cross skilling ability. Design & Architecture of big data systems. Experience in ETL, Data Governance, Data Quality, Good understanding of data oper

Senior java developer with Spark - VP - TAMPA

Citi

Remote or Tampa, Florida, USA

Full-time

The Java Applications Developer with Spark is a senior level position responsible for establishing and implementing new or revised application systems and programs in coordination with the Technology team. The overall objective of this role is to lead applications systems analysis and programming activities. Responsibilities: Lead integration of functions to meet goals, deploy new products, and enhance processesAnalyze complex business processes, system processes, and industry standards to defi

Big Data/Spark Principal Architect

AIT Global, Inc.

US

Full-time

Job Title: Big Data/Spark Principal Architect Location: Remote Mandatory Skills - Spark, SCALA, Kafka, Streaming, technical architecture, Data Bricks, Data Engineering, data quality, Data Governance. Years Of Experience15 to 20 Years Job Description: Hands on with Spark (data frame), Spark SQL, Databricks, AWS Glue, Scala/Spark or PySpark, Kafka or another streaming technology. Good learning & cross skilling ability. Design & Architecture of big data systems. Experience in ETL, Data Governance,

Data Engineer (SPARK, EMR, REDSHIFT, AWS) - 100% REMOTE

Provish Consulting

Remote

Full-time

Job Title: Data Engineer (SPARK, EMR, REDSHIFT, AWS) Job Location: 100% REMOTE (Candidate needs to work in PST time zone) Job Type: 6 + Months Role Summary: We are seeking a highly skilled Data Engineer to join our client, focusing on developing and maintaining databases, driving efficient data analysis, and providing analytical support for cross-functional teams. The ideal candidate should be experienced with EMR, SPARK, REDSHIFT and AWS Cloud, and a background in data privacy. Preferred candid

Senior Data Engineer

Apton Inc

Remote

Contract, Third Party

8-12+ years of proven experience in Data Engineering, worked on designing and developing software with Big Data, Data Lake/ Lake House ecosystem, Data Analytics, backend microservices architecture, and heterogeneous data types at scale Proven in-depth experience in creating ELT/integrations pipelines using Databricks, Spark, Python, SQL, Scala, Kafka, Presto, Parquet, Streaming, events, bots, AWS/cloud ecosystem. Proficient in developing Micro Services and using AWS frameworks such as SQS, Strea

AWS Databricks Architect

Chabez Tech LLC

Remote

Third Party, Contract

Position: AWS Databricks Architect Experience: 15+ years Location: Remote Job Description:We are seeking a highly experienced AWS Databricks Architect to lead the architecture and design of scalable data solutions within the AWS cloud ecosystem for ThermoFisher. The ideal candidate will bring deep expertise in cloud-native data processing, analytics, and architecture, along with strong leadership capabilities to drive large-scale projects.Responsibilities:Design and implement end-to-end cloud d

Google Cloud Platform Data Architect / Engineer (Must work on W2)

Activesoft, Inc.

Remote

Contract

Required: Deep experienced data engineer/architect. Review ascend.io installation on Google Cloud Platform (Google Cloud Platform) Assess and give recommendations on best practices. Kubernetes, Spark, etc., are used by Ascend.io.

Senior General Data Architect

Amiseq Inc.

Remote

Contract

Job Description: * 15+ years in IT experience with a minimum 7+ Years of experience in data engineering, data platform, and analytics. * Projects delivered with hands-on experience in development on Databricks. * Working knowledge of any one cloud platform (AWS, Azure, or Google Cloud Platform). * Deep experience with distributed computing with Spark, including knowledge of Spark runtime internals. * Familiarity with CI/CD for production deployments. * Working knowledge of MLOps. * Current knowl

Fullstack Python Developer

Cosmic-I LLC DBA Northern Base

Remote

Full-time

Jib Title: Fullstack Python Developer Location: 100% Remote Type: Fulltime Experience in implementing server-side technologies with RESTful APIs and MVC/MVT design patterns using Django/Flask frameworks.Experienced in MVW frameworks like Django, /Django and jQuery. Experienced in developing web-based applications using Python, Django, PHP, XML, CSS5, HTML, DHTML, JavaScript, RabbitMQ, Jdk1.7.Excellent Programming skills at a higher level of abstraction using Scala and Spark.Experience in project

ML Data Infrastructure Engineer

DVARN

Remote

Contract

Role: ML Data Infrastructure Engineer Location: Sunnyvale, CA OPEN FOR 100% REMOTE Duration: 12 Months Candidates should have strong experience in Deep Leaning ML frameworks and model serving technologies which is lacking in most of the resumes. JD: 8+ years of software engineering experience, with 3+ years in ML serving/infrastructure\ Strong expertise in container orchestration (Kubernetes) and cloud platforms Experience with model serving technologies (TensorFlow Serving, Triton, KServe) Deep

Sr. Big Data Developer - W2 Resource Only

Nasscomm, Inc.

Remote

Contract

Big Data Developer - Minimum 6 yrs of experience in Cloud, Big Data, Spark & Scala - Must have skill : Experience in Spark and Scala with exposure about cloud platforms ( AWS, Azure, Google Cloud Platform) for big data processing and storage - Strong experience in Azure DLS - Strong experience in Databricks, data pipelines - Experience in Hadoop seeking someone with strong backend development expertise, particularly in Java (Spring framework) - Agile delivery experience

Senior Data Engineer with Healthcare Domain

E-Solutions, Inc.

California, USA

Full-time

Hi Professionals, Title: AWS Data Engineer with Healthcare Domain Location: Remote Duration: Fulltime JD: Mandatory Skills :- AWS Databricks,PySpark or Spark, Data Pipeline,ETL ,SQL,Data Warehouse "Disclaimer: E-Solutions Inc. provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty,

Lead Healthcare Data Scientist with Azure

VLink Inc

Remote

Contract

Job Title: Lead Healthcare Data Scientist with Azure Location: Remote (EST Only) Employment Type: Contract Duration: 6 month initial contract About VLink: Started in 2006 and headquartered in Connecticut, VLink is one of the fastest growing digital technology services and consulting companies. Since its inception, our innovative team members have been solving the most complex business, and IT challenges of our global clients. Job Description: Lead Healthcare Data Scientist Key Skills: Healthcar

Principal ML OPS Engineer

ProHire Solution

Remote

Full-time

Job Location: 100% Remote (USA) Job Summary: We are looking for a seasoned Principal ML OPS Engineer to architect, build, and optimize ML inference platform. The role demands an individual with significant expertise in Machine Learning engineering and infrastructure, with an emphasis on building Machine Learning inference systems. Proven experience in building and scaling ML inference platforms in a production environment is crucial. This remote position calls for exceptional communication skil

Data Engineer (Middle) ID35916

Agile Engine

Remote

Full-time

What you will do Build and support ETL pipelines;Monitor data pipelines, identify bottlenecks, optimize data processing and storage for performance and cost-effectiveness;Collaborate effectively with cross-functional teams including data scientists, analysts, software engineers, and business stakeholders;Working with Terraform to build AWS infrastructure;Analyze sources and build Cloud Data Warehouse and Data Lake solution.Must haves 3+ years of professional experience with Python;3+ years of pr

VDOT - Agentic Data Engineer

Cyber Resource Provider LLC

Remote or Richmond, Virginia, USA

Full-time, Part-time, Contract, Third Party

Agentic Data Engineer to design, develop, and deploy data pipelines that leverage agentic AI that solve real-world problems The Virginia Department of Transportation's Information Technology Division is seeking a highly skilled Agentic Data Engineer to design, develop, and deploy data pipelines that leverage agentic AI that solve real-world problems. The ideal candidate will have experience in designing data process to support agentic systems, ensure data quality and facilitating interaction bet

Data Engineer III

Kforce Technology Staffing

Remote or Mountain View, California, USA

Contract

RESPONSIBILITIES: Kforce has a client in Mountain View, CA that is seeking a Data Engineer . Responsibilities: * Developing/refactoring/optimizing real time data processing applications using apache Flink * Developing/refactoring/optimizing batch data processing applications using apache spark * Migration of real time data pipelines to isolated kafka infrastructure and coordinate migration efforts with consumers * Operational monitoring and on-call support for business critical applications *

MLOps Engineer

SkilzMatrix Digital

Pittsburgh, Pennsylvania, USA

Full-time

Technical Skills: o Amazon SageMaker: In-depth knowledge of SageMaker, including domain setup, configuration, and infrastructure management. o Cloud Knowledge: A deep understanding of cloud computing concepts, especially related to Amazon Web Services (AWS). o Infrastructure Design: Ability to design and implement MLOPs cloud solutions, considering scalability, security, and performance. o Experience: Practical firsthand experience with cloud MLOps and Data Analtics platforms, preferably AWS Sag

Data Engineer

Raas Infotek LLC

Remote

Contract

Position: Data Engineer Type: W2 Contract We re looking for a Data Engineer to design, build, and optimize data pipelines that support enterprise analytics and digital transformation initiatives. This role is ideal for someone who thrives in a fast-paced, data-driven environment and has a strong background in modern data engineering tools and cloud platforms.Responsibilities: Design, develop, and maintain scalable ETL/ELT pipelines using modern data frameworks Collaborate with data analysts, ar