Big Data Dev/Spark Scala Engineer

O'Fallon, MO, US • Posted 9 hours ago • Updated 9 hours ago
Full Time
No Travel Required
On-site
$80,000 - $110,000/yr
Company Branding Image
Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

  • Big Data
  • Scala
  • Spark
  • PySpark
  • Python
  • Apache NiFi
  • Apache Kafka
  • Ceph

Summary

Job Title: Big Data Dev/Spark Scala Engineer
Location: O''Fallon, MO
Job Type: Fulltime
 
Job Description:

Must Have Technical/Functional Skills
  • Experience with Apache Ozone and/or Ceph as storage backends for analytics workloads
  • Experience implementing exactly once / at least once streaming semantics
  • Strong background in Spark performance tuning (CPU, memory, I/O, shuffle)
  • Experience supporting mission critical production systems with strict SLAs
  • Familiarity with CI/CD pipelines and automated testing for data applications
  • Experience designing observability for streaming systems (lag, throughput, backpressure)
Technical Skills
  • Languages: Scala, Python (PySpark), SQL
  • Big Data: Apache Spark (Core, SQL, Structured Streaming)
  • Streaming: Kafka
  • Ingestion / Orchestration: Apache NiFi
  • Storage: Apache Ozone, Ceph, object storage concepts
  • OS & Tooling: Linux, Git, CI/CD, monitoring and logging tools
Roles & Responsibilities

Key Responsibilities

  • Design, develop, and maintain large scale Spark applications using Scala and PySpark
  • Build and operate streaming heavy data pipelines using Kafka and Spark Structured Streaming
  • Implement stateful streaming patterns including windowing, watermarking, late data handling, and checkpointing
  • Develop robust event replay and reprocessing workflows using Kafka offsets and partitions
  • Build ingestion and routing flows using Apache NiFi, including Kafka based ingestion patterns
  • Implement end to end ETL/ELT pipelines with strong emphasis on low latency, fault tolerance, and scalability
  • Optimize Spark jobs through partitioning strategies, memory tuning, shuffle optimization, and efficient data formats
  • Integrate Spark workloads with distributed object storage systems such as Apache Ozone and Ceph
  • Ensure data quality, consistency, and auditability through validation, reconciliation, and metadata capture
  • Collaborate with platform, infrastructure, and operations teams on production readiness and capacity planning
  • Support production systems, including monitoring, incident analysis, and root cause resolution
  • Contribute to reusable frameworks, coding standards, and engineering best practices
  • Participate in architecture reviews, code reviews, and technical documentation
Required Qualifications
  • Bachelor’s degree in Computer Sci ence, Engineering, or equivalent practical experience
  • Strong hands on experience with Apache Spark in production environments
  • Advanced proficiency in Scala and PySpark
  • Solid understanding of distributed systems and data processing at scale
  • Strong experience with Kafka based streaming architectures
  • Hands on experience with Spark Structured Streaming
  • Experience building batch and real time pipelines
  • Hands on experience with Apache NiFi for data ingestion and flow management
  • Strong SQL skills and experience working with structured and semi structured data
  • Experience working with object storage or distributed storage platforms
  • Proficiency with Linux, shell scripting, and Git based version control
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91097129
  • Position Id: 8971384
  • Posted 9 hours ago

Company Info

About Stanley David and Associates

We strive to add value and work as true partner with our clients

Stanley David And Associates is a recruitment specialist in the area of IT and Engineering and we stay firmly in our area of expertise, doing what we love.

We know the players and the companies and invest a lot of time getting to know candidates and clients in equal measure. This ensures a swift, cost effective and perfect placement whether it s permanent or interim.

In addition we have a reputation for having the best understanding of the market landscape, for sourcing great candidates

-We have a Global Footprint with offices in 3 countries USA, UK and India.

-SDNA Global have built up an incredible reputation within the IT strategic hiring.

-We work with Tier1 and Tier 2 IT Outsourcing companies for Leadership hiring needs in UK, Europe, USA and Indian geos.

-Each SDNA member has over 5 years of experience in Talent Acquisition

-We have successfully closed roles in countries UK, USA, Germany, Sweden, Dubai, France, Netherlands, Switzerland, Austria, Hungry, Spain, Italy, Norway, Denmark, Nigeria and South Africa

-Telecom, Media and Hi-tech

-Health care and Life Sciences

-Energy and Utilities

-CPG, Retail and Transport

-Banking and Financial Services


About_Company_OneAbout_Company_Two
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

O'Fallon, Missouri

Today

Easy Apply

Full-time

100,000 - 110,000

Remote or San Antonio, Texas

3d ago

Easy Apply

Full-time

Depends on Experience

Search all similar jobs