AI Optimization Engineer || NYC, NY (Onsite) ||

New York, NY, US • Posted 15 days ago • Updated 15 days ago
Contract W2
On-site
$70 - $90/hr
Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

  • AI Optimization Engineer

Summary

Title: AI Optimization Engineer

Duration: 6 Months

Location: NYC, NY (Onsite)

Long Term Contract

Qualifications

Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.

Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.

Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.

Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.

Experienced in scalable infrastructure for deploying and managing large language models (LLMs),

HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.

Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.

Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.

Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.

Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.

Exploratory Data Analysis - Plotly, Seaborn, matplotlib

Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP

Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace

Flask API Development and Security

Container Runtimes: Enroot, Pyxis, Podman

Linux (RHEL/CentOS) System Administration

Model Optimization techniques using Triton with TRTLLM

Desired Qualifications:

Experience with data cleaning, feature scaling, and normalization

Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript

Creating vector embeddings

Tools and Platforms like - AWS (SageMaker, Lambda, EC2)

Database Technologies Oracle, MS-SQL, MongoDB, Redis and MySQL

SQL and PL/SQL Scripting

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91100016
  • Position Id: 8887930
  • Posted 15 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote or New York, New York

Today

Full-time

USD 169,541.00 - 169,541.00 per year

New York, New York

6d ago

Easy Apply

Contract, Third Party

$DOE

Hybrid in New York, New York

7d ago

Easy Apply

Contract

00+

New York, New York

Today

Full-time

USD 171,000.00 - 190,000.00 per year

Search all similar jobs