AI and LLM Engineer

  • REMOTE WORK, VA
  • Posted 1 day ago | Updated 1 hour ago

Overview

Remote
On Site
USD 120,001.00 - 160,000.00 per year
Full Time

Skills

Science
Security Clearance
Advanced Analytics
Large Language Models (LLMs)
Open Source
Prompt Engineering
Scripting
Benchmarking
Partnership
Collaboration
Onboarding
Generative Artificial Intelligence (AI)
Computer Science
Data Science
Management
Training
Python
LangChain
PyTorch
TensorFlow
Amazon SageMaker
Amazon S3
Step-Functions
Workflow
Continuous Integration
Continuous Delivery
Terraform
Evaluation
Testing
Amazon Web Services
Orchestration
Use Cases
Regulatory Compliance
FedRAMP
NIST 800-53
Machine Learning (ML)
Problem Solving
Conflict Resolution
Debugging
Artificial Intelligence
Information Technology
Systems Engineering
FOCUS

Job Details

Job ID: 2600092

Location: REMOTE WORK, VA, US

Date Posted: 2026-01-06

Category: Engineering and Sciences

Subcategory: Machine Learning Engineer

Schedule: Full-time

Shift: Day Job

Travel: No

Minimum Clearance Required: None

Clearance Level Must Be Able to Obtain: Public Trust

Potential for Remote Work: Yes

Description

We are seeking a hands-on AI & LLM Engineer to join the AWS AI/GenAI Solutions Team within the IRS Advanced Analytics Program (AAP). This role focuses on the practical development and deployment of large language models (LLMs) and GenAI solutions using AWS services such as SageMaker, Bedrock, and open-source frameworks.

The engineer will be directly responsible for coding pipelines, fine-tuning models, building inference endpoints, and integrating GenAI workflows into production. By working closely with data engineers, architects, and Trustworthy AI specialists, this role ensures that GenAI capabilities are secure, scalable, and aligned with IRS mission needs.

Key Responsibilities:
  • Build end-to-end LLM pipelines: data preparation, training, fine-tuning, and evaluation of models using SageMaker and Bedrock.
  • Develop prompt engineering strategies, chaining pipelines, and custom evaluation scripts to validate LLM behavior.
  • Implement RAG (retrieval-augmented generation) workflows by integrating LLMs with IRS data sources.
  • Code and deploy inference endpoints, APIs, and integration layers for mission teams to consume LLM services.
  • Optimize model performance, latency, and cost through benchmarking, hyperparameter tuning, and scaling strategies.
  • Embed bias detection, fairness, and explainability checks in model pipelines, in partnership with Trustworthy AI specialists.
  • Contribute to CI/CD automation for LLM deployments, including rollback and retraining workflows.
  • Write production-grade Python code and leverage frameworks such as Hugging Face Transformers, LangChain, PyTorch, or TensorFlow.
  • Document workflows and create reusable templates/accelerators for faster onboarding of new GenAI use cases.
  • Participate in hands-on troubleshooting and debugging of pipelines, deployments, and model behavior.

Qualifications

Required Qualifications:
  • Bachelor's or master's degree in computer science, Data Science, or related field.
  • 5+ years of hands-on AI/ML engineering experience, including direct model training, fine-tuning, and deployment.
  • Strong expertise in Python programming and ML/LLM frameworks (Hugging Face, LangChain, PyTorch, TensorFlow).
  • Experience with AWS AI services (SageMaker, Bedrock, S3, Lambda, Step Functions) in production workflows.
  • Proven ability to build and deploy inference endpoints and APIs for AI/ML workloads.
  • Familiarity with CI/CD pipelines and IaC (Terraform, CloudFormation) for model deployment.
  • Practical understanding of LLM evaluation methods (prompt testing, bias/toxicity detection, response consistency).


Desired Skills:
  • Certifications: AWS Certified Machine Learning Specialty or equivalent.
  • Experience implementing RAG pipelines or multi-model orchestration for enterprise use cases.
  • Familiarity with federal compliance frameworks (FedRAMP, NIST 800-53) and secure AI/ML operations.
  • Knowledge of Trustworthy AI principles (auditability, explainability, fairness) in LLM contexts.
  • Strong problem-solving skills and ability to debug real-world AI/LLM issues in production.


Target salary range: $120,001 - $160,000. The estimate displayed represents the typical salary range for this position based on experience and other factors.


Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About SAIC