Principal Software Engineer - Data Analytics

• Posted 17 hours ago • Updated 4 hours ago
Full Time
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • Streaming
  • Cloud Computing
  • Programming Languages
  • Python
  • Information Retrieval
  • Analytical Skill
  • JDBC
  • Amazon Web Services
  • Data Processing
  • Reporting
  • Analytics
  • Use Cases
  • LlamaIndex
  • Decision Support
  • Workflow
  • Scalability
  • Agile
  • Scrum
  • Sprint
  • User Stories
  • Management
  • Data Analysis
  • JIRA
  • Bug Tracking
  • Design Review
  • Code Review
  • Java
  • Backend Development
  • Computer Science
  • Data Structure
  • Algorithms
  • Software Engineering
  • Software Development
  • Scala
  • Apache Spark
  • Apache Kafka
  • Apache Hadoop
  • Snow Flake Schema
  • Generative Artificial Intelligence (AI)
  • LangChain
  • Orchestration
  • Prompt Engineering
  • Artificial Intelligence
  • Debugging
  • Big Data
  • Real-time
  • Data Warehouse
  • Communication
  • Collaboration
  • Innovation
  • Productivity
  • Health Insurance
  • Broadband
  • SAP BASIS
  • Media
  • Internet
  • Digital Marketing
  • Advertising

Summary

About the Role

PubMatic is seeking a Data Analytics-focused Principal Software Engineer with experience in building and optimizing AI agents, including strong skills in Hadoop, Spark, Scala, Kafka, Spark Streaming, and cloud-based solutions, with proficiency in programming languages such as Scala and Python. You will be responsible for developing advanced
AI agents that enhance our data analytics capabilities, enabling our platform to handle complex information retrieval, contextual understanding, and adaptive interactions, ultimately improving our data-driven insights and analytical workflows.

What You'll Do:
  • Build, design, and implement our highly scalable, fault-tolerant big data platform to process terabytes of data and provide customers with in-depth analytics.
  • Develop backend services using Java, REST APIs, JDBC, and AWS.
  • Build and maintain Big Data pipelines using technologies like Spark, Hadoop, Kafka, and Snowflake.
  • Architect and implement real-time data processing workflows and automation frameworks.
  • Lead multiple projects to develop features for data processing and reporting platforms, and collaborate with product managers and cross-functional teams.
  • Collaborate with functional teams to build products to deliver end-to-end products and features, and fix bugs for better performance.
  • Design and develop GenAI-powered agents for analytics, operations, and data enrichment use cases using frameworks like LangChain, LlamaIndex, or custom orchestration systems.
  • Integrate LLMs (e.g., OpenAI, Claude, Mistral) into existing services for query understanding, summarization, and decision support.
  • Manage end-to-end GenAI workflows, including prompt engineering, fine-tuning, vector embeddings, and retrieval-augmented generation (RAG).
  • Working closely with cross-functional teams on improving the availability and scalability of large data platforms and the functionality of PubMatic software.
  • Participate in Agile/Scrum processes such as sprint planning, sprint retrospective, backlog grooming, user story management, and work item prioritization.
  • Frequently discuss with product managers about the software features to include in the PubMatic Data Analytics platform.
  • Support customer issues over email or JIRA (bug tracking system), provide updates, and patches to customers to fix the issues.
  • Perform code and design reviews for code implemented by peers or as per the code review process.

We'd Love for You to Have:
  • 6+ years of coding experience in Java and backend development.
  • Solid computer science fundamentals, including data structure and algorithm design, and creation of architectural specifications.
  • Experience in developing the implementation of professional software engineering best practices for the full software development life cycle, including coding standards and code reviews.
  • Hands-on experience with Big Data tools and systems like Scala Spark, Kafka, Hadoop, and Snowflake.
  • Proven experience in building GenAI applications, including:
    o LLM integration (OpenAI, Anthropic, Cohere, etc.)
    o LangChain, or similar agent orchestration libraries
    o Prompt engineering, embedding, and retrieval-based generation (RAG)
  • Experience in developing and deploying scalable, production-grade AI or data systems.
  • Ability to lead end-to-end feature development and debug distributed systems.
  • Experience in developing and delivering large-scale big data pipelines, real-time systems & data warehouses would be preferred.
  • Demonstrated ability to achieve stretch goals in a very innovative and fast-paced environment.
  • Demonstrated ability to learn new technologies quickly and independently.
  • Excellent verbal and written communication skills, especially in technical communications.
  • Strong interpersonal skills and a desire to work collaboratively.

Qualification:

Should have a bachelor's degree in engineering or an equivalent degree from a well-known institute/university.

Additional Information:

Return to Office: PubMatic employees throughout the globe have returned to our offices via a hybrid work schedule (3 days "in office" and 2 days "working remotely") that is intended to maximize collaboration, innovation, and productivity among teams and across functions.

Benefits: Our benefits package includes the best of what leading organizations provide, such as paternity/maternity leave, healthcare insurance, broadband reimbursement. As well, when we're back in the office, we all benefit from a kitchen loaded with healthy snacks and drinks and catered lunches and much more!.

Diversity and Inclusion: PubMatic is proud to be an equal opportunity employer; we don't just value diversity, we promote and celebrate it. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status

About PubMatic

PubMatic is one of the world's leading scaled digital advertising platforms, offering more transparent advertising solutions to publishers, media buyers, commerce companies and data owners, allowing them to harness the power and potential of the open internet to drive better business outcomes.

Founded in 2006 with the vision that data-driven decisioning would be the future of digital advertising, we enable content creators to run a more profitable advertising business, which in turn allows them to invest back into the multi-screen and multi-format content that consumers demand.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX16960c
  • Position Id: ebdd61150708db63c0be3d0b7d0539ea
  • Posted 17 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

No location provided

Today

Full-time

Remote or Bethesda, Maryland

Today

Full-time

USD 163,800.00 - 257,400.00 per year

Bellevue, Washington

Today

Full-time

USD 173,100.00 - 276,800.00 per year

Austin, Texas

Today

Full-time

USD 110,700.00 - 171,800.00 per year

Search all similar jobs