Senior AI Engineer

Overview

On Site
$100 - $120 hourly
Contract - W2
Contract - Temp

Skills

GRID
Software Design
Data Loading
Research
Algorithms
Testing
Computer Hardware
Cloud Computing
Continuous Improvement
Documentation
Workflow
Computer Science
Machine Learning (ML)
JAX
PyTorch
GPU
Optimization
Debugging
Performance Tuning
Communication
Teamwork
Collaboration
Evaluation
GitHub
Open Source
Deep Learning
Writing
CUDA
Performance Monitoring
Training
Data Engineering
Machine Learning Operations (ML Ops)
Problem Solving
Conflict Resolution
Rapid Prototyping
Artificial Intelligence
Messaging

Job Details

RESPONSIBILITIES:
Kforce has a client in Orlando, FL that is seeking a Senior AI Engineer.

Summary:
As a highly skilled and driven Senior AI Engineer, you will be a founding team member, developing the critical data and AI infrastructure for training foundation models for power grid applications. You will be instrumental in building and optimizing the end-to-end systems, data pipelines, and training processes that will power our AI research. Working closely with research scientists, you will translate cutting-edge research into robust, scalable, and efficient implementations, enabling the rapid development and deployment of transformational AI solutions.

Responsibilities:
* Designing, building, and optimizing all aspects of large-scale training and fine-tuning, from dataloading to inference, to maximize Model Flop Utilization (MFU) on large compute clusters
* Working closely and proactively with research scientists to translate models and algorithms into high-performance, production-ready code, integrating and testing the latest advancements
* Relentlessly profiling and resolving training performance bottlenecks, optimizing the entire training stack for speed and efficiency
* Contributing to the technology evaluations and selection of hardware, software, and cloud services for the AI infrastructure platform
* Using MLOps frameworks (MLFlow, WnB, etc.) to ensure best practices across the model lifecycle, ensuring reproducibility, reliability, and continuous improvement
* Creating thorough documentation for infrastructure and training procedures, staying updated on advancements in training strategies, and driving improvements in workflows and infrastructure

REQUIREMENTS:
* Master's degree or higher in Computer Science, Engineering, or a related technical field; Candidates with more experience can be considered for a higher level or vice-versa
* 5 or more years in a Data & AI (Artificial Intelligence) Engineer or Machine Learning Engineer, focusing on building and optimizing infrastructure for large-scale machine learning systems
* Deep practical expertise with AI frameworks (PyTorch, Jax, Pytorch Lightning, etc.), large-scale multi-node GPU training, and optimization strategies for large foundation models on distributed compute infrastructure
* Excellent problem-solving, debugging, and performance optimization skills, with a data-driven approach to identifying and resolving technical challenges
* Strong communication and teamwork skills, experience with MLOps best practices for model tracking, evaluation, and deployment

Preferred qualifications include:
* Public GitHub profile with a track record of open-source contributions to data engineering or deep learning infrastructure projects
* Experience writing CUDA/Triton/CUTLASS kernels
* Proficiency with performance monitoring and profiling tools for distributed training and data pipelines

This role requires deep hands-on expertise in distributed training, data engineering, MLOps, a proven track record of building scalable AI infrastructure. Our successful candidate will be a high-agency individual demonstrating initiative, problem-solving, and a commitment to delivering robust and scalable solutions for rapid prototyping and turnaround.

The pay range is the lowest to highest compensation we reasonably in good faith believe we would pay at posting for this role. We may ultimately pay more or less than this range. Employee pay is based on factors like relevant education, qualifications, certifications, experience, skills, seniority, location, performance, union contract and business needs. This range may be modified in the future.

We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees. Salaried personnel receive paid time off. Hourly employees are not eligible for paid time off unless required by law. Hourly employees on a Service Contract Act project are eligible for paid sick leave.

Note: Pay is not considered compensation until it is earned, vested and determinable. The amount and availability of any compensation remains in Kforce's sole discretion unless and until paid and may be modified in its discretion consistent with the law.

This job is not eligible for bonuses, incentives or commissions.

Kforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.

By clicking ?Apply Today? you agree to receive calls, AI-generated calls, text messages or emails from Kforce and its affiliates, and service providers. Note that if you choose to communicate with Kforce via text messaging the frequency may vary, and message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You will always have the right to cease communicating via text by using key words such as STOP.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Kforce Technology Staffing