Python Gen AI

  • Jersey City, NJ
  • Posted 6 hours ago | Updated 6 hours ago

Overview

On Site
$120,000 - $140,000
Full Time

Skills

API
Amazon Web Services
Art
Flask
Generative Artificial Intelligence (AI)
Good Clinical Practice
Artificial Intelligence
Cloud Computing
Collaboration
Language Models
Django
Docker
Evaluation
Google Cloud Platform
Management
Kubernetes
LangChain
LlamaIndex
Machine Learning Operations (ML Ops)
Microsoft Azure
Natural Language Processing
Open Source
Orchestration
Prompt Engineering
Python
Research
Semantic Search
Testing
Use Cases
Vector Databases

Job Details

We are looking for an experienced Python Engineer with expertise in Generative AI (GenAI) to design, develop, and deploy AI-driven solutions. You ll work with state-of-the-art language models (LLMs), fine-tune open-source models, build scalable AI pipelines, and integrate GenAI capabilities into real-world applications.


Key Responsibilities:

  • Design and develop GenAI-based applications using Python and LLMs (e.g., GPT, LLaMA, Mistral, Claude).

  • Fine-tune and deploy open-source and proprietary models using frameworks like Hugging Face Transformers, LangChain, or LlamaIndex.

  • Build REST APIs and integrate GenAI capabilities into products or internal tools.

  • Work with vector databases (e.g., Pinecone, FAISS, Weaviate, Chroma) for RAG (Retrieval-Augmented Generation).

  • Develop and manage prompt engineering strategies for various business use cases.

  • Implement model evaluation, testing, and observability in GenAI applications.

  • Collaborate with data scientists, MLOps engineers, and product teams to productionize GenAI solutions.

  • Stay current with the latest research, trends, and tools in the GenAI/LLM space.


Required Skills:

  • Strong experience in Python programming.

  • Deep understanding of LLMs and Generative AI concepts.

  • Hands-on experience with Hugging Face Transformers, OpenAI API, or similar frameworks.

  • Familiarity with LangChain, LlamaIndex, or other GenAI orchestration frameworks.

  • Experience with prompt engineering and RAG pipelines.

  • Knowledge of FastAPI, Flask, or Django for API development.

  • Familiarity with Docker, Kubernetes, and cloud platforms (AWS/Google Cloud Platform/Azure) for model deployment.

  • Experience working with vector stores and semantic search.

  • Good understanding of NLP evaluation metrics and techniques.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Techim INC