Role: AI/ML Engineer
Company: Arch Systems
Client: US Federal Govt.
Location: Remote
Type: Full-time
Mandotory: Candidate must have experience working in an US Federal Govt project
Job Summary:
Arch Systems is seeking a highly experienced, hands-on Principal AI/ML Leader with 8+ years of experience designing, building, and deploying scalable AI/ML solutions in production environments. This role is ideal for a senior technical leader who can provide architectural direction, lead AI/ML solution delivery, and remain deeply hands-on in development, integration, deployment, and troubleshooting.
The ideal candidate will have strong experience developing machine learning models, building data pipelines, deploying production-grade AI systems, and working in modern cloud and containerized environments. This individual must also have experience supporting federal customers, federal programs, or federal contracts, with an understanding of secure, compliance-driven delivery environments.
We are looking for someone who can lead from the front, mentor technical teams, collaborate with stakeholders, and directly contribute to the design, development, and deployment of AI/ML solutions.
- Key Responsibilities:
Lead the design, development, and deployment of machine learning models using frameworks such as TensorFlow, PyTorch, and scikit-learn - Serve as a hands-on principal technical leader, contributing directly to architecture, coding, model development, deployment, optimization, and troubleshooting
- Build and maintain scalable data pipelines using Apache Kafka, REST APIs, and SFTP for real-time and batch data ingestion
Perform data preprocessing, profiling, feature engineering, and synthetic data generation using tools such as SDV, Faker, and pandas-profiling - Implement robust data quality and validation frameworks using Great Expectations and Deequ
Develop and deploy RESTful APIs for ML model serving using FastAPI and Spring Boot
Containerize applications using Docker and orchestrate deployments with Kubernetes and Oracle Kubernetes Engine (OKE) - Manage cloud infrastructure and AI/ML deployments on Oracle Cloud Infrastructure (OCI)
- Build and maintain CI/CD pipelines using GitHub Actions and Azure DevOps for automated testing and deployment
Develop interactive dashboards and visualizations using Dash, Plotly, and React - Ensure security and compliance with standards such as NIST SP 800-53, FIPS 140-2, DISA STIGs, and RMF
- Collaborate with cross-functional teams including data engineers, software developers, architects, federal stakeholders, and business leaders
- Mentor engineers and establish best practices across AI/ML engineering, MLOps, model lifecycle management, and production deployment
- Provide technical leadership in AI/ML strategy, solution architecture, implementation planning, and delivery within federal and regulated environments