Java Principal Engineer

Overview

Remote
Depends on Experience
Full Time
No Travel Required

Skills

Java17
SQL
Apache Kafka
Payments
Testing
Spring Framework
Amazon Web Services
Communication
Scalability
Real-time

Job Details

Overview:
We are building a next-generation embedded payments platform designed for real-time, high-throughput transaction processing. The system currently processes 110 payments per second, with a target of 500 per second by the end of January. We re seeking Principal Engineers who can drive architectural decisions, ensure system scalability, and lead complex engineering initiatives across distributed systems.

Responsibilities:

  • Lead the design and architecture of distributed, fault-tolerant, and highly available systems.
  • Optimize Java-based systems for concurrency, performance, and memory efficiency.
  • Guide engineering teams in adopting best practices for system design, observability, and reliability.
  • Drive improvements in real-time processing pipelines using event-driven technologies such as Kafka.
  • Mentor staff-level engineers and provide technical leadership across the team.
  • Collaborate with product owners and stakeholders to translate requirements into scalable solutions.
  • Ensure system resilience through failover, degradation handling, and SLA adherence.

Must-Have Skills:

  • Java 17 Expert-level, with deep knowledge of multithreading, concurrency, and performance tuning.
  • Spring Boot Proven experience building enterprise-grade microservices.
  • Kafka (or equivalent) Strong experience with event-driven architecture and stream processing.
  • AWS (ECS, RDS, ElastiCache/Redis) Hands-on deployment and optimization experience.
  • Terraform Ability to design infrastructure-as-code for production systems.
  • SQL Strong relational database design and performance optimization.
  • Distributed Systems Proven track record designing high-scale, fault-tolerant systems.
  • Testing Strong unit, integration, and end-to-end testing discipline.
  • Observability Experience with OpenTelemetry, Grafana, Splunk or similar tools.

Nice-to-Have Skills:

  • Expertise in availability engineering and reliability frameworks.
  • Experience creating monitoring dashboards and distributed tracing setups.
  • Strong ability to drive system design independently with minimal guidance.

Soft Skills:

  • Ability to lead cross-functional technical discussions and influence architectural decisions.
  • Excellent communication skills with both technical and non-technical stakeholders.
  • Strong mentoring and coaching abilities.
  • Comfortable working in Kanban within a SAFe Agile environment.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.