Job Title: Telecom System Architect Location: Basking Ridge, New Jersey (On-Site)
Job Description:
Design and lead scalable, fault-tolerant mediation architectures, including file ingestion, format conversion, SFTP distribution, event monitoring, and downstream delivery pipelines.
Build and maintain core mediation services using Java 17 and Spring Boot, including Spring Integration, Spring Batch, and Spring Kafka.
Contribute to shared starters and reusable platform libraries to standardize and accelerate development.
Parse, decode, and transform binary telecom data formats, including ASN.1 encoded CDRs/UDRs, AMA records, and proprietary carrier formats at the byte-stream level.
Design, deploy, and operate containerized workloads on Kubernetes and OpenShift.
Own Helm chart definitions, resource tuning, PVC strategies, horizontal scaling, and rolling upgrade policies.
Design and optimize Oracle database schemas for high-volume mediation metadata, reconciliation state, and audit trails.
Write, tune, and analyze complex SQL and PL/SQL for operational reporting and performance optimization.
Define and enforce SLOs and SLIs, lead incident response, drive post-mortems, and implement error budgets and auto-remediation patterns.
Build production-grade observability using structured logging, distributed tracing, metrics, and alerting.
Implement and maintain monitoring stacks using ELK/EFK, OpenTelemetry, Jaeger, Prometheus, and Grafana.
Profile and optimize JVM performance, Kafka consumer throughput, file I/O pipelines, and database query patterns.
Establish performance benchmarks, regression gates, and reliability standards.
Debug and resolve system-level issues across OS, JVM, networking, file descriptors, garbage collection, and thread contention.
Design and operate Apache Kafka at scale, including partition strategies, consumer group tuning, offset management, and dead-letter handling.
Apply strong understanding of SFTP, object storage (S3-compatible), and file lifecycle management.
Collaborate with network, OSS/BSS, and downstream consumer teams to align technical solutions.
Drive architectural decisions, conduct design reviews, and mentor engineers across teams.
Apply an SRE mindset, including on-call discipline, capacity planning, and chaos engineering fundamentals.