Principal AI Platform Engineer

Remote • Posted 2 hours ago • Updated 2 hours ago
Full Time
Remote
USD $190,000.00 - 225,000.00 per year
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Innovation
  • Operational Excellence
  • Open Systems
  • Internal Communications
  • Integrated Circuit
  • IC
  • CPU
  • GPU
  • Computer Hardware
  • Visualization
  • Video
  • Professional Services
  • IaaS
  • Version Control
  • Hardening
  • Security Controls
  • Change Management
  • Backbone.js
  • Orchestration
  • Meta-data Management
  • Computer Networking
  • Storage
  • Database
  • Promotions
  • Regression Analysis
  • IT Security
  • Regulatory Compliance
  • Mentorship
  • Artificial Intelligence
  • Python
  • API
  • PostgreSQL
  • LangChain
  • Continuous Integration
  • Continuous Delivery
  • Terraform
  • Microsoft Azure
  • DevOps
  • RBAC
  • Management
  • Auditing
  • Testing
  • Documentation
  • Aerospace
  • DO-178C
  • GitHub
  • JIRA
  • C
  • C++
  • Workflow
  • Change Control
  • Evaluation
  • UI
  • React.js
  • Professional Development

Summary

Job Title: Principal AI Platform Engineer

Location: Remote - US

Compensation:$190,000 - $225,000 + Bonus Eligible

Who we are: Lynx delivers modular, open standards-based software that transforms how high-assurance, mission-critical edge systems are built, deployed, and maintained. Our secure edge computing solutions enable innovation and operational excellence in the world's most demanding environments, from aerospace and defense to commercial and industrial systems. We partner across industries including automotive, medical, and critical infrastructure to deliver tailored solutions aligned with each customer's mission and operational requirements. Our key products and services are:

  • MOSA.ic: LYNX MOSA.ic is a modular software framework and architecture purpose-built for mission-critical edge computing. Based on the Modular Open Systems Approach (MOSA), it provides a flexible foundation for building secure, scalable, and certifiable edge systems.
  • LYNX MOSA.ic.AI: LYNX MOSA.ic.AI is a unified CPU and GPU software platform that enables deterministic, certifiable deployment of AI and advanced workloads in mission-critical edge systems. It brings control, performance, and lifecycle governance together, allowing AI to operate predictably within safety-critical environments without compromising certification or system integrity.
  • CoreSuite 2.0: CoreSuite 2.0 is Lynx's safety-critical GPU for graphics enablement framework designed for mission-critical edge computing systems. It provides hardware-accelerated graphics, visualization, and video processing capabilities that can be certified for high-assurance systems.
  • Services: Lynx Services is Lynx's professional services organization that helps customers design, integrate, certify, deploy, and maintain safety- and security-critical systems. It supports industries like aerospace, defense, automotive, and industrial computing through consulting, engineering, integration, and lifecycle support, reducing development risk and accelerating certification in standards-driven, mission-critical environment.

Role Overview

This should be a builder-architect: someone who can take multiple partially mature AI tools and make them operate like one disciplined platform. The right person should be equally comfortable with engineering architecture, backend integration, cloud infrastructure, LLM tooling, and production hardening.
AI workflow orchestration with LangChain / LangGraph or equivalent frameworks
LLM observability, prompt/version management, and evaluation systems such as Langfuse
Azure platform engineering using Container Apps, PostgreSQL, Key Vault, Entra ID, private networking, and monitoring
Secure backend and API integrations with systems such as CodeBeamer, GitHub, and webhook-driven workflows
Production hardening through infrastructure as code, CI/CD, testing, rollback, rate limiting, security controls, and auditability
Regulated-workflow thinking, where traceability, human-in-the-loop review, and controlled change management matter as much as model quality

Mission for the role

Own the AI platform as the engineering backbone for AI-assisted certification and engineering workflows. This person should make the platform secure, stable, measurable, and extensible so that new AI tools can be built and operated with confidence.

Key responsibilities
Define and enforce the platform standard for how AI tools use orchestration frameworks, prompt assets, tracing, and metadata
Bring existing advanced tools into alignment with shared platform conventions while preserving important agentic or workflow-specific behavior
Build and maintain Azure-based production infrastructure, including networking, identity, secrets, storage, database, monitoring, and deployment patterns
Implement infrastructure as code and CI/CD for sandbox-to-production promotion
Deepen LLMOps capabilities, including prompt versioning, golden datasets, automated evaluations, cost tracking, feedback loops, regression detection, and release controls
Own secure integrations with CodeBeamer, GitHub, and event-driven APIs or webhooks
Establish operational discipline through logging, alerting, rollback, test coverage, runbooks, rate limiting, and supportability
Partner with engineering, IT, security, and compliance stakeholders to support auditable AI-assisted workflows
Own and evolve the Platform AI to provide standard and secure approach to access AI assisted capabilities across the organization for certification workflows
Mentor and coach other senior/intermediate engineers on team, provide technical guidance, and conduct architectural review for trade offs
Help define technical trajectory of the platform and AI tools

Qualifications
10+ years of relevant experience
Bachelor's Degree in engineering related discipline preferred
Strong Python backend engineering and API integration experience
Strong Azure platform experience, especially Container Apps, VNet/private endpoints, Entra ID, Managed Identity, Key Vault, PostgreSQL, ACR, and monitoring

Hands-on experience with LLM application frameworks such as LangChain, LangGraph, or close equivalents
Hands-on experience with LLM observability or evaluation tooling such as Langfuse or equivalent tracing and eval systems
Experience building CI/CD and infrastructure as code with Terraform, Bicep, GitHub Actions, Azure DevOps, or comparable tools
Experience securing internal platforms with RBAC, secrets management, service-to-service auth, webhook validation, rate limiting, and audit logging
Ability to design reliable multi-step or agentic workflows, including retries, state handling, guardrails, and output validation
Strong operational judgment around testing, rollback, monitoring, alerting, documentation, and runbooks

Strongly preferred
Experience in regulated, safety-critical, aerospace, defense, medical, or similarly controlled environments
Familiarity with DO-178C-style traceability, auditability, formal review workflows, or human-in-the-loop approval requirements
Experience integrating with CodeBeamer, GitHub Enterprise, Jira, or similar enterprise engineering systems
Familiarity with C/C++ code analysis or test-generation workflows
Experience with prompt governance, change control, and evaluation datasets
Some comfort with internal-tool UI work such as React, though this should remain secondary to platform, backend, and infrastructure strength

Sound Exciting? Get in touch today! We have very robust benefits including:

  • Low-cost Medical / Dental / Vision coverage options
  • 401K with generous employer match
  • Responsible Paid Time Off + Paid Holidays
  • Remote work opportunities based on role
  • Employee Assistance Program (EAP)
  • Career growth and professional development opportunities

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91155221
  • Position Id: 8a656d0cf17f055619e4e29202b3035e
  • Posted 2 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Full-time

Remote

Yesterday

Easy Apply

Full-time

Depends on Experience

Remote

Today

Full-time

USD 210,000.00 - 250,000.00 per year

Remote or San Francisco, California

Today

Full-time

USD 120,100.00 - 214,500.00 per year

Search all similar jobs