apache spark Jobs in pennsylvania

Refine Results
281 - 300 of 330 Jobs

Azure Data Solution Architect

RNR IT Solutions, Inc.

Remote or Little Elm, Texas, USA

Full-time

Job Title: Data Warehouse & Azure Data Architect / Principal Data Engineer Location: Remote 15+ years of experience in database architecture, data engineering, and analytics solutionsStrong expertise in Azure Data Platform: Azure Data Factory (ADF), Azure Databricks, Synapse Analytics, Azure Data Lake Storage, Event Hub, Azure Analysis Services, Power BI, Microsoft FabricProficient in Python, Spark SQL, T-SQL, and metadata-driven developmentExperience designing and implementing Data Vault 2.0,

AWS Engineer

Axiom Global Technologies, Inc.

Remote

Contract

We're looking for a talented and experienced Senior AWS Engineer to join our team. In this role, you'll be instrumental in designing, developing, and implementing robust and scalable backend solutions, with a strong focus on data processing, API development, and AWS cloud infrastructure. If you're passionate about building high-performance systems and have a solid understanding of modern cloud architectures, we encourage you to apply! Key Responsibilities: Design and develop data-driven solutio

Resident Solution Architect

Cycle 3 IT Staffing

Remote

Contract

Cycle3 IT Staffing is seeking several Resident Solutions Architects who are SENIOR LEVEL for a REMOTE role. Architecting and Leadership: o Deep understanding of modern data architecture, including Lakehouse, Medallion architecture, and Data Mesh. Data Engineering: o Strong programming experience in Python; Scala is a significant plus. o Proficiency in complex query tuning. o Experience building Slowly Changing Dimensions (SCD) TYPEs. o Familiarity with DBT. o Experience with structured streaming

Technical Architect/Data Architect

MSquare Systems Inc.

Remote

Full-time

Key Responsibilities: Design scalable and secure end-to-end data pipeline architectures across cloud-native platforms (AWS, Azure) Build and demonstrate technical solutions using Databricks, Snowflake, Informatica, AWS Glue, etc. Integrate data validation and observability tools such as Great Expectations, Monte Carlo, or DataGaps Optimize pipeline performance, cost-efficiency, and data governance frameworks across diverse environments Collaborate with sales teams to understand client needs and

Machine Learning Artificial Intelligence Engineer with Big Data and Cloudera

Buxton Consulting

Remote

Contract

Machine Learning Artificial Intelligence Engineer with Big Data and Cloudera Location Fully Remote Must Haves- Strong project experience in Big Data platform with Amazon Web Services and Cloudera Data Platform is must. Strong experience with REST API development using Python frameworks (Django, Flask etc.) is must. Micro Services/Web service development experience using Spring framework is must. 4-5 years of programming experience in AWS, Linux and Data Science notebooks is must. Good project e

Databricks Engineer

Oscar Technology

Remote

Full-time

Job Title: Databricks Engineer - Remote Location: Remote Job Type: Full-Time Experience Level: Mid to Senior (5+ years) About the Role: We are seeking a skilled and experienced Databricks Engineer with a strong background in both Azure and AWS cloud platforms. As a key member of our Data Engineering team, you will design, develop, and optimize scalable data solutions using Apache Spark on Databricks, enabling advanced analytics and data-driven decision-making across the organization. This is

Databricks Engineer

Xyant Services, Inc.

Remote

Contract

Lead Databricks Engineer Role SummaryArchitecture & Engineering Leadership:Leads the end-to-end architecture, design, and implementation of big data pipelines on Databricks Lakehouse Platform, integrating with Azure, AWS, or Google Cloud Platform. Data Pipeline & ETL Expertise:Develops scalable, high-performance ETL/ELT pipelines using PySpark, Delta Lake, and SQL, enabling real-time and batch processing of large datasets. Team & Project Oversight:Mentors junior engineers, drives Agile delivery,

ML Ops Engineer (AWS GovCloud Databricks) - Remote - W2 only

Saksoft

Remote

Contract

Position: ML Ops Engineer (AWS GovCloud Databricks) Duration: 12+ months contract Location: Charlotte, NC (100% remote) JOB DESCRIPTION SKILLS NEEDED : Bachelor's degree in computer science, Engineering, Applied Mathematics or related field4 to 6 years of strong experience with AWS Gov Cloud environments, Export Control FedRAMP environmentsOverall 8 to 10 years of solid experience in the areas of data engineering, machine learning data science4 to 6 years of strong experience with the following

Program Manager

Tektend llc

Remote or New York, New York, USA

Contract, Third Party

Job Title: Program Manager Location: Remote Duration: 12+ Months Key Skills & Qualifications: Experience: 7+ years of experience in program management, with at least 3-5 years focused on digital transformation, cloud data platforms, and data analytics initiatives. Expertise in Databricks: Strong hands-on experience with Databricks platform, including but not limited to data engineering, machine learning workflows, and data science pipelines. Data Analytics: Solid understanding of data analytics

Quality Engineer

Echo IT Solutions, Inc.

Pittsburgh, Pennsylvania, USA

Contract

Job Title: Quality Engineer Job Type: Long Term Contract Location: Pittsburgh, PA (3 days Onsite in week)Job Description:6+ years exp as Quality Engineer coordinating with US-based and international personnel Required Skills: Big data Hadoop (hive, impala, spark, etc). Strong in Hadoop testing Python development background for writing Python scripts for automation Will be using their own automation tool (not QfaaS) No manual testing SQL HQL nice to have Heavy backend data testing No p

Full Stack Developer - REMOTE

Contract Staffing Recruiters

Remote

Contract

The Opportunity: We are seeking a full stack engineer/developer with experience in projects emphasizing big data and application migration. This person will be responsible for supporting our customers migration of hundreds of applications to Databricks including authentication, rewiring of connections, building new libraries, and adapting notebooks and code. Work with the Business Intelligence team and operational stakeholders to design and implement migration strategy of read only and transact

Data Engineer Microsoft Fabric

Kanini

Remote or Denver, Colorado, USA

Third Party, Contract

Job Title: Data Engineer Microsoft Fabric with Oracle Cloud Integration Location: Nashville TN Job Type: Long Term [Hybrid/ Onsite) Job Summary: We are looking for an experienced Data Engineer with strong expertise in Microsoft Fabric and proven skills integrating data from Oracle Cloud environments. The role involves designing and building scalable data pipelines, supporting data integration and modernization efforts, and ensuring smooth interoperability between Oracle Cloud and Azure-based an

Need !!! ML Ops Support Engineer @ Reading , PA

Srinav Inc.

Reading, Pennsylvania, USA

Third Party, Contract

Job Title: ML Ops Support EngineerLocation: Reading, PA (Day 1 onsite)Duration: Long-term Mandatory Skills:MLOps L2 Support Engineer to provide 24/7 production support for machine learning (ML) and data pipelines.The role requires on-call support, including weekends, to ensure high availability and reliability of ML workflows.The candidate will work with Dataiku, AWS, CI/CD pipelines, and containerized deployments to maintain and troubleshoot ML models in production. Key Responsibilities:Inciden

Data Pipeline Engineer

LowCountry Staffing & Consulting

Remote

Full-time

This position is 100% remote. No travel required. Design, develop and maintain manageable and scalable data pipelines and ETL processes to support data acquisition, integration and delivery for upstream applications, analytics, and other data-driven initiatives Design and implement advanced data storage solutions (databases, data warehouses, and data lakes) and efficient data transport layers (REST API s, message queues, message buses) Collaborate with executive leadership, product owners, tech

Platform Engineer / SRE

Javen Technologies, Inc

Remote

Contract

Platform Engineer Contract Location: Cincinnati OH- 100 % Remote Direct Client : Banking Industry Key Skills: Cloud Infrastructure Management (AWS, Google Cloud Platform, Digital Ocean)Automation & Orchestration (Ansible, Kubernetes, Docker)Software Development (Python, Go, PHP, or similar)System Monitoring & Performance OptimizationAPILinux administrationBanking Industry Job Summary: Responsible for building outstanding software solutions to drive the success of a business. Build various aspe

Data Engineer - Bridgeville, PA/ Homewood, AL/Irving, TX/ Brooklyn, OH - Onsite - W2 Only

Outcome Logix LLC

Pittsburgh, Pennsylvania, USA

Contract, Third Party

Key Responsibilities: Design, develop, and maintain scalable and robust data pipelines. Integrate data from multiple sources, including APIs, cloud platforms, databases, and third-party systems. Build and manage data infrastructure and architecture that supports analytics and reporting needs. Ensure data quality, consistency, reliability, and governance across systems. Collaborate with data scientists, analysts, and business stakeholders to understand data requirements. Implement data secu

Couchbase DBA - Spanish Native Speaker

Shimento, Inc.

Remote

Third Party, Contract

Couchbase DBA Location: - Mexico, California Contract Role Spanish language speaker Couchbase certification is a plus Job Description We are looking for an experienced Senior Database Administrator (DBA) with 9+ years of expertise in designing, managing, and optimizing large-scale, high-performance databases across cloud and on-premises environments. The ideal candidate will have deep DBA expertise in both SQL and NoSQL databases, with hands-on experience in Couchbase, PostgreSQL, MySQL, Oracle,

MLOps L2 Support Engineer (On-Call & Weekend Support) - Reading, PA (Onsite)

Kaizen Technologies

Pennsylvania, USA

Contract, Third Party

Greetings for the day, EST Time Zone and Locals Prefered Role: MLOps L2 Support Engineer (On-Call & Weekend Support) Location: Reading, PA (Onsite) Job Summary: MLOps L2 Support Engineer to provide 24/7 production support for machine learning (ML) and data pipelines. The role requires on-call support, including weekends, to ensure high availability and reliability of ML workflows. The candidate will work with Dataiku, AWS, CI/CD pipelines, and containerized deployments to maintain and t

Google Cloud Platform Data Engineer

Acadia Technologies, Inc.

Philadelphia, Pennsylvania, USA

Full-time

Google Cloud Platform Cloud Platform Expertise:Core Services:A deep understanding of Google Cloud services like BigQuery, Cloud Storage, Cloud Dataflow, Cloud Pub/Sub, Cloud Composer (Airflow), and Cloud Dataproc is crucial. Data Warehousing:Proficiency in building and optimizing data warehouses using BigQuery for analytics and data storage. ETL Processes:Experience with Extract, Transform, Load (ETL) processes to move data between different sources, using tools like Cloud Dataflow, Apache Bea

Microservices engineer- API / Security

HCL America Inc.

Remote

Full-time

We are HCLTech, one of the fastest-growing large tech companies in the world and home to 218,000 people across 60 countries, supercharging progress through industry-leading capabilities centered around Digital, Engineering and Cloud. ( ) The driving force behind that work, our people, are diverse, creative, and passionate, raising the bar for excellence on a regular basis. We, in turn, work hard to bring out the best in them as we strive to help them find their spark and become the best version