Overview
Skills
Job Details
Role: AI Engineer
Location: New Jersey
Duration: Long Term
Note: W2’s are highly Preferred
JD: In this role, you will be responsible for launching and implementing GenAl agentic solutions aimed at reducing the risk and cost of managing large-scale production environments with varying complexities. You will address various production runtime challenges by developing agentic Al solutions that can diagnose, reason, and take actions in production environments to improve productivity and address issues related to production support.
Essential Skills
1. 5+ years of software development in one or more languages (Python, C/C++, Go, Java); strong hands-on experience building and maintaining large-scale Python applications preferred.
2. 3+ years designing, architecting, testing, and launching production ML systems, including model deployment/serving, evaluation and monitoring, data processing pipelines, and model fine-tuning workflows.
3. Practical experience with Large Language Models (LLMs): API integration, prompt engineering, fine-tuning/adaptation, and building applications using RAG and tool-using agents (vector retrieval function calling, secure tool execution).
4. Understanding of different LLMs, both commercial and open source, and their capabilities (e.g., OpenAI, Gemini, Llama, Qwen, Claude).
5. Solid grasp of applied statistics, core ML concepts, algorithms, and data structures to deliver efficient and reliable solutions.
6. Strong analytical problem-solving, ownership, and urgency; ability to communicate complex ideas simply and collaborate effectively across global teams with a focus on measurable business impact.
7. Preferred: Proficiency building and operating on cloud infrastructure (ideally AWS), including containerized services (ECS/EKS), serverless (Lambda), data services ($3, DynamoDB, Redshift), orchestration (Step Functions), model serving (SageMaker), and infra-as-code (Terraform/CloudFormation).