Title: AI Optimization Engineer
Duration: 6 Months
Location: NYC, NY (Onsite)
Long Term Contract
Qualifications
Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.
Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.
Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.
Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.
Experienced in scalable infrastructure for deploying and managing large language models (LLMs),
HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.
Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.
Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.
Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.
Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.
Exploratory Data Analysis - Plotly, Seaborn, matplotlib
Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP
Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace
Flask API Development and Security
Container Runtimes: Enroot, Pyxis, Podman
Linux (RHEL/CentOS) System Administration
Model Optimization techniques using Triton with TRTLLM
Desired Qualifications:
Experience with data cleaning, feature scaling, and normalization
Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript
Creating vector embeddings
Tools and Platforms like - AWS (SageMaker, Lambda, EC2)
Database Technologies Oracle, MS-SQL, MongoDB, Redis and MySQL
SQL and PL/SQL Scripting