Senior Data Processing Platform Engineer

Overview

On Site
USD 148,000.00 per year
Full Time

Skills

GPU
High Availability
Data Science
Distributed Computing
KPI
Dashboard
Real-time
Workflow
Continuous Improvement
Kubernetes
Microservices
Performance Tuning
Database
Big Data
Apache Spark
Software Development
Computer Science
Electrical Engineering
Customer Engagement
Software Engineering
Mathematics
Data Processing
CUDA
Machine Learning (ML)
Recruiting
Promotions
SAP BASIS
Law

Job Details

Our technology has no boundaries! NVIDIA is creating the world's most groundbreaking and innovative compute platforms for the world to use! It's because of our work that data engineers and data scientists can advance their ideas. We are looking for individuals to develop a data processing and ML platform for data scientists to use.

As a data processing platform engineer, you will design, implement and operate Kubernetes based GPU accelerated data processing service at scale, with high availability and reliability. You will lead and encourage adoption of the data processing service, your work should improve time to first query (TTFQ) metrics, drive platform engagement metrics, and come up with innovative solutions that blends with pioneering Nvidia's enterprise scale data science platform.

What you'll be doing:
  • Optimize distributed computing infrastructure by analyzing cost and right sizing for latency and performance
  • Identify KPIs and instrument metrics tracking for dashboards and alerting
  • Enhancing and maintaining a robust scale, reliable, real-time data processing service
  • Train data engineers, data scientists and production engineers how to adopt data processing workflows
  • Participate in on-call rotation, run-book implementation and continuous process improvement

What we need to see:
  • Experience architecting, developing and deploying large-scale distributed systems at scale
  • Strong Kubernetes experience on-premise and/or CSP, developing containerized microservices
  • Hands on experience in the performance tuning/troubleshooting Spark applications
  • Experience with distributed systems, databases, and big data systems like Ray, Spark Rapids
  • Familiarity with metrics collection, health monitoring, and observability tools
  • Building, operating and maintaining full stack software deployments coupled with excellent software programming skills
  • Master's or Bachelor's degree in Computer Science or Electrical Engineering or CE or equivalent experience.
  • A minimum of 5yrs experience with a background in software engineering and math

Ways to stand out from the crowd:
  • Prior data processing at scale on Nvidia GPUs
  • Experience with CUDA and/or using Nvidia GPUs for ML/DL

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.