Job Title:Senior Technical Architect
Location: Basking Ridge NJ 7920 ( Hybrid )
Duration: 6-12+ Months Contract
** Visa-independent candidates required **
Job Summary:
Senior Engineer / Architect – Packet Broker and Probes (Telecom)
About the Role
We are seeking a seasoned Architect to join our Packet Brokers and Probes platform team. This role sits at the intersection of telecom systems engineering, distributed backend development, and platform reliability. You will own and evolve a high-throughput probes management pipeline that ingests, transforms, analyzes using AI/ML, and routes event streams across carrier-grade infrastructure. This is a hands-on technical leadership role — you will design systems, write production code, debug at the byte level, and set engineering standards across the team.
Key Responsibilities
· System Architecture & Design – Lead the design of scalable, fault-tolerant components including file ingestion, format conversion, gNMP extraction, SFTP distribution, event monitoring, and downstream delivery pipelines.
· Binary Protocol & Telecom Data Expertise – Parse, decode, and transform telecom data formats including SNMP MIBs,and proprietary carrier formats at the byte-stream level.
· Java / Spring Boot Development – Build and maintain core probe mediation services using Java 17+ and Spring Boot (Spring Integration, Spring Batch, Spring Kafka); contribute to shared starters and reusable platform libraries.
· Kubernetes / OpenShift Operations – Design and deploy containerized workloads on OpenShift/Kubernetes; own Helm chart definitions, resource tuning, PVC strategies, horizontal scaling, and rolling upgrade policies.
· Oracle Database – Design and optimize schemas for high-volume probes metadata, reconciliation state, and audit trails; write and tune complex SQL for operational reporting.
· Site Reliability Engineering (SRE) – Define and enforce SLOs/SLIs; lead incident response; drive post-mortems; implement error budgets and auto-remediation patterns into the platform.
· Advanced Observability & Monitoring – Build deep observability into the platform: structured logging (Logback/ELK), distributed tracing (OpenTelemetry/Jaeger), metrics (Micrometer/PrometheGrafana), and alerting strategies for carrier-grade availability.
· Performance & Reliability Engineering – Profile and optimize JVM performance, Kafka consumer throughput, file I/O pipelines, and database query patterns; establish benchmarks and regression gates.
· Technical Leadership – Drive architectural decisions, conduct design reviews, mentor engineers
Required Qualifications
· 17+ years of software engineering experience, with 5+ years in telecom or carrier-grade OSS/BSS/OEM platforms
· Deep expertise in Java (17+) and Spring Boot ecosystem (Spring Integration, Spring Kafka, Spring Batch)
· Hands-on experience with SNMP, gRPC and byte-stream manipulation
· Real-time high volume data pipelines with online analytics (stream)
· Strong Kubernetes and OpenShift skills: Helm, resource policies, stateful workloads, network policies, operator patterns
· Proficient in Oracle SQL and PL/SQL: schema design, indexing strategies, partitioning, explain plans
· SRE mindset: SLO/SLI definition, on-call discipline, chaos engineering fundamentals, capacity planning
· Experience building production-grade observability: Prometheus, Grafana, OpenTelemetry, ELK/EFK, alerting pipelines
· Demonstrated ability to debug and resolve issues at the system level — OS, JVM, network, file descriptor limits, pressure, thread contention
· Experience with Apache Kafka at scale: partition strategy, consumer group tuning, offset management, dead-letter handling
· Solid understanding of SFTP, S3/object storage, file lifecycle management, and large-scale file distribution patterns
Preferred Qualifications
· Experience with systems-level tooling or microservices (a plus given our Go-based streaming components)
· Familiarity with 4G/LTE network data (VoLTE, CSCF,VZW/carrier-specific formats)
· Knowledge of OpenShift GitOps (ArgoCD), CI/CD pipelines (Jenkins/Tekton)
· Contributions to or design of shared platform starters (Spring Boot autoconfiguration, Helm library charts)
· Experience with replica coordination, distributed locking, and exactly-once processing guarantees
· Experience in AI-ML, LLM/SLM
What We''re NOT Looking For
This is not a role for someone who works exclusively at the API layer. The right candidate is comfortable reading a hex dump, correlating a Kafka consumer lag spike to a pause, and tuning a JVM to sustain 50K records/sec through a binary transformation pipeline.
Tech Stack
Language: Java 17+, Python
Frameworks: Spring Boot, Spring Integration, Spring Kafka, Spring Batch
Messaging: Apache Kafka
Storage: Oracle DB, S3 / Object Storage
Container Platform: Kubernetes, OpenShift, Helm
Observability: Prometheus, Grafana, OpenTelemetry, ELK, Logback
File Transport: SFTP, S3, NFS/PVC
Telecom Formats: ASN.1, AMA, 3GPP CDR/UDR, binary byte streams
CI/CD: Jenkins, GitOps