AI and LLM Engineer

Overview

Remote

On Site

USD 120,001.00 - 160,000.00 per year

Full Time

Skills

Science

Security Clearance

Advanced Analytics

Large Language Models (LLMs)

Open Source

Prompt Engineering

Scripting

Benchmarking

Partnership

Collaboration

Onboarding

Generative Artificial Intelligence (AI)

Computer Science

Data Science

Management

Training

Python

LangChain

PyTorch

TensorFlow

Amazon SageMaker

Amazon S3

Step-Functions

Workflow

Continuous Integration

Continuous Delivery

Terraform

Evaluation

Testing

Amazon Web Services

Orchestration

Use Cases

Regulatory Compliance

FedRAMP

NIST 800-53

Machine Learning (ML)

Problem Solving

Conflict Resolution

Debugging

Artificial Intelligence

Information Technology

Systems Engineering

FOCUS

Job Details

Job ID: 2600092

Location: REMOTE WORK, VA, US

Date Posted: 2026-01-06

Category: Engineering and Sciences

Subcategory: Machine Learning Engineer

Schedule: Full-time

Shift: Day Job

Travel: No

Minimum Clearance Required: None

Clearance Level Must Be Able to Obtain: Public Trust

Potential for Remote Work: Yes

Description

We are seeking a hands-on AI & LLM Engineer to join the AWS AI/GenAI Solutions Team within the IRS Advanced Analytics Program (AAP). This role focuses on the practical development and deployment of large language models (LLMs) and GenAI solutions using AWS services such as SageMaker, Bedrock, and open-source frameworks.

The engineer will be directly responsible for coding pipelines, fine-tuning models, building inference endpoints, and integrating GenAI workflows into production. By working closely with data engineers, architects, and Trustworthy AI specialists, this role ensures that GenAI capabilities are secure, scalable, and aligned with IRS mission needs.

Key Responsibilities:

Build end-to-end LLM pipelines: data preparation, training, fine-tuning, and evaluation of models using SageMaker and Bedrock.
Develop prompt engineering strategies, chaining pipelines, and custom evaluation scripts to validate LLM behavior.
Implement RAG (retrieval-augmented generation) workflows by integrating LLMs with IRS data sources.
Code and deploy inference endpoints, APIs, and integration layers for mission teams to consume LLM services.
Optimize model performance, latency, and cost through benchmarking, hyperparameter tuning, and scaling strategies.
Embed bias detection, fairness, and explainability checks in model pipelines, in partnership with Trustworthy AI specialists.
Contribute to CI/CD automation for LLM deployments, including rollback and retraining workflows.
Write production-grade Python code and leverage frameworks such as Hugging Face Transformers, LangChain, PyTorch, or TensorFlow.
Document workflows and create reusable templates/accelerators for faster onboarding of new GenAI use cases.
Participate in hands-on troubleshooting and debugging of pipelines, deployments, and model behavior.

Qualifications

Required Qualifications:

Bachelor's or master's degree in computer science, Data Science, or related field.
5+ years of hands-on AI/ML engineering experience, including direct model training, fine-tuning, and deployment.
Strong expertise in Python programming and ML/LLM frameworks (Hugging Face, LangChain, PyTorch, TensorFlow).
Experience with AWS AI services (SageMaker, Bedrock, S3, Lambda, Step Functions) in production workflows.
Proven ability to build and deploy inference endpoints and APIs for AI/ML workloads.
Familiarity with CI/CD pipelines and IaC (Terraform, CloudFormation) for model deployment.
Practical understanding of LLM evaluation methods (prompt testing, bias/toxicity detection, response consistency).

Desired Skills:

Certifications: AWS Certified Machine Learning Specialty or equivalent.
Experience implementing RAG pipelines or multi-model orchestration for enterprise use cases.
Familiarity with federal compliance frameworks (FedRAMP, NIST 800-53) and secure AI/ML operations.
Knowledge of Trustworthy AI principles (auditability, explainability, fairness) in LLM contexts.
Strong problem-solving skills and ability to debug real-world AI/LLM issues in production.

Target salary range: $120,001 - $160,000. The estimate displayed represents the typical salary range for this position based on experience and other factors.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

About SAIC

Share