AI Optimization Engineer
Hybrid in NYC, NY, US • Posted 3 days ago • Updated 3 days ago

Cloud Destinations LLC
Dice Job Match Score™
⭐ Evaluating experience...
Job Details
Skills
- Python
- libraries like NumPy and scikit-learn
- TensorFlow
- PyTorch
- Keras
- SLURM workload manager
- GPU-accelerated clusters
Summary
Qualifications
- Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.
- Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.
- Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.
- Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.
- Experienced in scalable infrastructure for deploying and managing large language models (LLMs),
- HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.
- Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.
- Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.
- Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.
- Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.
- Exploratory Data Analysis - Plotly, Seaborn, matplotlib
- Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP
- Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace
- Flask API Development and Security
- Container Runtimes: Enroot, Pyxis, Podman
- Linux (RHEL/CentOS) System Administration
- Model Optimization techniques using Triton with TRTLLM Desired Qualifications:
- Experience with data cleaning, feature scaling, and normalization
- Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript
- Creating vector embeddings
- Tools and Platforms like - AWS (SageMaker, Lambda, EC2)
- Database Technologies Oracle, MS-SQL, MongoDB, Redis and MySQL
- SQL and PL/SQL Scripting
- Dice Id: 91097117
- Position Id: 8887264
- Posted 3 days ago
Company Info
One of the leading US-based staffing and IT consulting partner. Experience exceptional service and top-tier talent across industries. Count on us for staffing solutions that cater to the unique demands of the American market.
Our experienced recruiters ensure a seamless fit within your team, accelerating success. But we go beyond staffing and empower employees with fully sponsored certification programs, keeping them ahead. Experience comprehensive benefits including health, wellness coverage, dental insurance, vision insurance, as well as flexible hours, remote work options, and a robust 401K plan to ensure a secure future at the companies we represent.
At Cloud Destinations, we bring industry expertise and a passion for excellence. From Enterprise Cloud Strategy to Managed Infrastructure Services, Digital Transformation, BI & Data Analytics, Security, Data Engineering, and more, we navigate the IT landscape with finesse. Choose us as your trusted partner, witness transformative talent and exceptional service. Let's unlock new possibilities and drive your success in the dynamic world of IT together.
Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs
