Sr Full Stack Engineer – Generative AI & Python

Irving, TX, US • Posted 1 day ago • Updated 1 day ago
Contract Corp To Corp
Contract W2
12 Months
No Travel Required
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • API
  • Artificial Intelligence
  • Generative Artificial Intelligence (AI)
  • RESTful
  • Python
  • Telecommunications
  • Java
  • Server Side Development

Summary

Job title: Full Stack Developer

Location: Irving, TX

Duration: Long-term

 

ROLE SUMMARY:

We are seeking a Full Stack Developer – AI & Cloud to design, build, and deploy scalable enterprise applications at the intersection of Java/Python server-side development, AWS cloud services, and AI/LLM edge deployments.

 

KEY RESPONSIBILITIES:

  • Design and develop robust server-side applications and RESTful microservices using Java (Spring Boot) and Python, ensuring scalability, security, and high availability across distributed systems.
  • Architect and deploy cloud-native solutions on AWS leveraging services including Lambda, ECS, API Gateway, SageMaker, S3, and EventBridge.
  • Fine-tune open-weight LLM models (e.g., LLaMA, Mistral, Phi) using frameworks such as Hugging Face PEFT and LoRA for domain-specific enterprise use cases.
  • Deploy and manage AI/LLM inference runtimes on edge devices including laptops, on-premise servers, and network routers using tools such as Ollama, llama.cpp, or TensorRT-LLM.
  • Build and maintain CI/CD pipelines for containerized microservices and edge AI model deployments using Docker, Kubernetes, and AWS DevOps tooling.
  • Conduct code reviews, contribute to architectural decisions, and mentor junior engineers on AI-integrated full stack development practices.

 

REQUIRED QUALIFICATIONS:

  • 10+ years of full stack development experience with strong server-side proficiency in Java (Spring Boot) and Python.
  • Telecom Industry experience is a must.
  • Hands-on experience building and deploying microservices on AWS, including services such as Lambda, ECS, API Gateway, and SageMaker.
  • Demonstrated experience fine-tuning LLM models using Hugging Face Transformers, PEFT, or LoRA.
  • Proven ability to deploy and optimize LLM inference on edge devices (CPU/edge GPU) using runtimes such as Ollama, llama.cpp, or ExecuTorch.
  • Proficiency with containerization and orchestration tools including Docker and Kubernetes.
  • Strong understanding of RESTful API design, event-driven architectures, and distributed microservices patterns.

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: infotx
  • Position Id: 8986734
  • Posted 1 day ago
Contact the job poster
HN

Harsha Nagaraj

Recruiter @ InfoVision, Inc.
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Irving, Texas

Today

Easy Apply

Contract, Third Party

Depends on Experience

Dallas, Texas

2d ago

Easy Apply

Third Party, Contract

Depends on Experience

Hybrid in Dallas, Texas

14d ago

Easy Apply

Third Party, Contract

Depends on Experience

Irving, Texas

Today

Easy Apply

Contract

$75 - $79

Search all similar jobs