Apply Now

Big Data Dev/Spark Scala Engineer

O'Fallon, MO, US • Posted 11 hours ago • Updated 11 hours ago

Full Time

No Travel Required

On-site

$100,000 - $110,000/yr

Stanley David and Associates

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

Apache NiFi
Big Data
Apache Kafka
Data Processing

Summary

Role - Big Data Dev/Spark Scala Engineer

Experience Required - 7+ Years

Must Have Technical/Functional Skills

• Experience with Apache Ozone and/or Ceph as storage backends for analytics workloads

• Experience implementing exactly once / at least once streaming semantics

• Strong background in Spark performance tuning (CPU, memory, I/O, shuffle)

• Experience supporting mission critical production systems with strict SLAs

• Familiarity with CI/CD pipelines and automated testing for data applications

• Experience designing observability for streaming systems (lag, throughput, backpressure)

Technical Skills

• Languages: Scala, Python (PySpark), SQL

• Big Data: Apache Spark (Core, SQL, Structured Streaming)

• Streaming: Kafka

• Ingestion / Orchestration: Apache NiFi

• Storage: Apache Ozone, Ceph, object storage concepts

• OS & Tooling: Linux, Git, CI/CD, monitoring and logging tools

Roles & Responsibilities

Key Responsibilities

• Design, develop, and maintain large scale Spark applications using Scala and PySpark

• Build and operate streaming heavy data pipelines using Kafka and Spark Structured Streaming

• Implement stateful streaming patterns including windowing, watermarking, late data handling, and checkpointing

• Develop robust event replay and reprocessing workflows using Kafka offsets and partitions

• Build ingestion and routing flows using Apache NiFi, including Kafka based ingestion patterns

• Implement end to end ETL/ELT pipelines with strong emphasis on low latency, fault tolerance, and scalability

• Optimize Spark jobs through partitioning strategies, memory tuning, shuffle optimization, and efficient data formats

• Integrate Spark workloads with distributed object storage systems such as Apache Ozone and Ceph

• Ensure data quality, consistency, and auditability through validation, reconciliation, and metadata capture

• Collaborate with platform, infrastructure, and operations teams on production readiness and capacity planning

• Support production systems, including monitoring, incident analysis, and root cause resolution

• Contribute to reusable frameworks, coding standards, and engineering best practices

• Participate in architecture reviews, code reviews, and technical documentation

Required Qualifications

• Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience

• Strong hands on experience with Apache Spark in production environments

• Advanced proficiency in Scala and PySpark

• Solid understanding of distributed systems and data processing at scale

• Strong experience with Kafka based streaming architectures

• Hands on experience with Spark Structured Streaming

• Experience building batch and real time pipelines

• Hands on experience with Apache NiFi for data ingestion and flow management

• Strong SQL skills and experience working with structured and semi structured data

• Experience working with object storage or distributed storage platforms

• Proficiency with Linux, shell scripting, and Git based version control

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91097129
Position Id: 8971126
Posted 11 hours ago

Company Info

About Stanley David and Associates

We strive to add value and work as true partner with our clients

Stanley David And Associates is a recruitment specialist in the area of IT and Engineering and we stay firmly in our area of expertise, doing what we love.

We know the players and the companies and invest a lot of time getting to know candidates and clients in equal measure. This ensures a swift, cost effective and perfect placement whether it s permanent or interim.

In addition we have a reputation for having the best understanding of the market landscape, for sourcing great candidates

-We have a Global Footprint with offices in 3 countries USA, UK and India.

-SDNA Global have built up an incredible reputation within the IT strategic hiring.

-We work with Tier1 and Tier 2 IT Outsourcing companies for Leadership hiring needs in UK, Europe, USA and Indian geos.

-Each SDNA member has over 5 years of experience in Talent Acquisition

-We have successfully closed roles in countries UK, USA, Germany, Sweden, Dubai, France, Netherlands, Switzerland, Austria, Hungry, Spain, Italy, Norway, Denmark, Nigeria and South Africa

-Telecom, Media and Hi-tech

-Health care and Life Sciences

-Energy and Utilities

-CPG, Retail and Transport

-Banking and Financial Services

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Big Data Dev/Spark Scala Engineer

O'Fallon, Missouri

•

Today

Job Title:Big Data Dev/Spark Scala EngineerLocation:O''Fallon, MOJob Type: FulltimeJob Description: Must Have Technical/Functional Skills Experience with Apache Ozone and/or Ceph as storage backends for analytics workloadsExperience implementing exactly once / at least once streaming semanticsStrong background in Spark performance tuning (CPU, memory, I/O, shuffle)Experience supporting mission critical production systems with strict SLAsFamiliarity with CI/CD pipelines and automated testing for

Easy Apply

Full-time

80,000 - 110,000

Marketing Campaign Analyst

Remote or San Antonio, Texas

•

3d ago

Role :: Marketing Campaign Analyst Location :: San Antonio, TX Type :: Fulltime Job Description Role - Marketing Campaign Analyst Experience Required - 3+ Years Must Have Technical/Functional Skills Salesforce Data Cloud D360 Snowflake ETL/ELT Data Model Objects Data Analysis Roles & Responsibilities 1. Strong technical skills in Snowflake data analysis and SQL including querying joining and transforming large data sets to identify key data elements 2. Experience in data mapping and schema d

Easy Apply

Full-time

Depends on Experience

Search all similar jobs