Senior/Software Engineer - Data Analytics

• Posted 15 hours ago • Updated 2 hours ago
Full Time
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • Streaming
  • Cloud Computing
  • Programming Languages
  • Python
  • Information Retrieval
  • Analytical Skill
  • JDBC
  • Amazon Web Services
  • Data Processing
  • Analytics
  • Use Cases
  • LlamaIndex
  • Decision Support
  • Workflow
  • Scalability
  • Agile
  • Scrum
  • Sprint
  • User Stories
  • Management
  • Data Analysis
  • JIRA
  • Bug Tracking
  • Design Review
  • Code Review
  • Java
  • Backend Development
  • Computer Science
  • Data Structure
  • Algorithms
  • Software Engineering
  • Software Development
  • Scala
  • Apache Spark
  • Apache Kafka
  • Apache Hadoop
  • Snow Flake Schema
  • Generative Artificial Intelligence (AI)
  • LangChain
  • Orchestration
  • Prompt Engineering
  • Artificial Intelligence
  • Debugging
  • Big Data
  • Real-time
  • Data Warehouse
  • Communication
  • Collaboration
  • Innovation
  • Productivity
  • Health Insurance
  • Broadband
  • SAP BASIS
  • Media
  • Internet
  • Digital Marketing
  • Advertising

Summary

About the Role

PubMatic is seeking Data Analytics-focused Senior / Software Engineers with expertise in building and optimizing AI agents, including strong skills in Hadoop, Spark, Scala, Kafka, Spark Streaming, and cloud-based solutions, with proficiency in programming languages such as Scala and Python. You will be responsible for developing advanced AI agents that enhance our data analytics capabilities, enabling our platform to handle complex information retrieval, contextual understanding, and adaptive interactions, ultimately improving our data-driven insights and analytical workflows.

What You'll Do:
  • Build, design, and implement our highly scalable, fault-tolerant big data platform to process terabytes of data and provide customers with in-depth analytics.
  • Develop backend services using Java, REST APIs, JDBC, and AWS.
  • Build and maintain Big Data pipelines using technologies like Spark, Hadoop, Kafka, and Snowflake.
  • Architect and implement real-time data processing workflows and automation frameworks.
  • Design and develop GenAI-powered agents for analytics, operations, and data enrichment use cases using frameworks like LangChain, LlamaIndex, or custom orchestration systems.
  • Integrate LLMs (e.g., OpenAI, Claude, Mistral) into existing services for query understanding, summarization, and decision support.
  • Manage end-to-end GenAI workflows including prompt engineering, fine-tuning, vector embeddings, and retrieval-augmented generation (RAG).
  • Working closely with cross-functional teams on improving the availability and scalability of large data platforms and the functionality of PubMatic software.
  • Participate in Agile/Scrum processes such as sprint planning, sprint retrospective, backlog grooming, user story management, and work item prioritization.
  • Frequently discuss with product managers about the software features to include in the PubMatic Data Analytics platform.
  • Support customer issues over email or JIRA (bug tracking system), provide updates, and patches to customers to fix the issues.
  • Perform code and design reviews for code implemented by peers or as per the code review process.

We'd Love for You to Have:
  • 1-5 plus years of coding experience in Java and backend development.
  • Solid computer science fundamentals, including data structure and algorithm design, and creation of architectural specifications.
  • Expertise in developing the Implementation of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews.
  • Hands-on experience with Big Data tools and systems like Scala Spark, Kafka, Hadoop, Snowflake.
  • Proven expertise in building GenAI applications, including:
    LLM integration (OpenAI, Anthropic, Cohere, etc.)
    LangChain, or similar agent orchestration libraries
    Prompt engineering, embedding, and retrieval-based generation (RAG)
  • Experience in developing and deploying scalable, production-grade AI or data systems.
  • Ability to lead end-to-end feature development and debug distributed systems.
  • Experience in developing and delivering large-scale big data pipelines, real-time systems & data warehouses would be preferred.
  • Demonstrated ability to achieve stretch goals in a very innovative and fast-paced environment.
  • Demonstrated ability to learn new technologies quickly and independently.
  • Excellent verbal and written communication skills, especially in technical communications.
  • Strong interpersonal skills and a desire to work collaboratively.

Qualification:

Should have a bachelor's degree in engineering or an equivalent degree from a well-known institute/university.

Additional Information:

Return to Office: PubMatic employees throughout the globe have returned to our offices via a hybrid work schedule (3 days "in office" and 2 days "working remotely") that is intended to maximize collaboration, innovation, and productivity among teams and across functions.

Benefits: Our benefits package includes the best of what leading organizations provide, such as paternity/maternity leave, healthcare insurance, broadband reimbursement. As well, when we're back in the office, we all benefit from a kitchen loaded with healthy snacks and drinks and catered lunches and much more!.

Diversity and Inclusion: PubMatic is proud to be an equal opportunity employer; we don't just value diversity, we promote and celebrate it. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status

About PubMatic

PubMatic is one of the world's leading scaled digital advertising platforms, offering more transparent advertising solutions to publishers, media buyers, commerce companies and data owners, allowing them to harness the power and potential of the open internet to drive better business outcomes.

Founded in 2006 with the vision that data-driven decisioning would be the future of digital advertising, we enable content creators to run a more profitable advertising business, which in turn allows them to invest back into the multi-screen and multi-format content that consumers demand.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX16960c
  • Position Id: cb0de115f01d007cbca5fd5c9919ce5f
  • Posted 15 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

No location provided

Today

Full-time

San Jose, California

Today

Full-time

USD 323,000.00 - 428,000.00 per year

Redwood City, California

Today

Full-time

USD 124,700.00 - 208,850.00 per year

Redmond, Washington

Today

Full-time

USD 119,800.00 - 234,700.00 per year

Search all similar jobs