Sr. Python Architect

Overview

On Site
Hybrid
Depends on Experience
Full Time

Skills

CUDA
Python
GPU
cloud infrastructure
performance and scalability
DevOps
containerization
zuora
Elastic Stack

Job Details

Our client 8bit.AI is a dynamic startup in the Bay Area, CA seeking to hire Full-time employees and focused on developing a high-performance, multi-technology, vendor-independent, xPU-based Accelerated Cloud Computing platform. We stack massive clusters purpose-built for high-performance parallel computing and aim to launch a global accelerated cloud solution. Additionally, the firm will focus on broader Artificial General Intelligence (AGI) products, supercomputing services, and end-to-end AI engineering services.

About the Role:

We are searching for a talented and motivated Python Professional to join our team and play a key role in integrating AI models with our backend GPU infrastructure. You will be responsible for developing and maintaining code to seamlessly connect AI models to our GPU resources, ensuring efficient utilization and optimal performance. Additionally, you will be tasked with creating user-friendly interfaces for monitoring and controlling GPU usage, providing valuable insights for our team.

Responsibilities:

  • Develop and implement Python code to deploy AI models with our backend GPU infrastructure using DevOps and MLOps.
  • Design and build data models with data ingestion and processing experience.
  • Optimize code for performance and scalability, ensuring efficient utilization of GPU resources.
  • Develop and maintain the HPC and GPU Cloud compute through self-service cloud platform
  • Develop and maintain DevOps pipeline to provide 100% lightsout operations including observability.
  • Design and implement APIs for seamless interaction with Orchestration layers and GPU infrastructure.
  • Create user-friendly interfaces for monitoring and controlling GPU usage, including real-time performance metrics and control functionalities.
  • Maintain existing codebase and troubleshoot any technical issues related to AI model deployment and GPU infrastructure.
  • Collaborate closely with data scientists, engineers, and other stakeholders to ensure successful project execution.

Qualifications:

  • Master s degree in Electrical Engineering, Computer Engineering, or a related field (or equivalent experience).
  • Minimum of 3 to 5 years of experience in working with GPU infrastructure, including knowledge of libraries like CUDA or cuDNN.
  • Proven experience as a Python developer with a strong understanding of object-oriented programming principles and best practices.
  • Familiarity with backend development concepts and experience working with APIs.
  • Prior experience in setting up automated DevOps pipelines, MLOps using MLFlow/KubeFlow/Bento, Observability with Elastic Stack or equivalent, security orchestration, containerization, experience with zuora/chargebee/zoho billing etc.
  • Solid understanding of OpenStack concepts and experience managing cloud infrastructure.
  • Experience with big data technologies like Hadoop, Spark, and Kafka.
  • Familiarity with data integration and ETL tools, such as Talend, Informatica, or Apache NiFi.
  • Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
  • A strong problem-solving mindset and a passion for learning new technologies.
  • Experience from companies like CoreWeave, Vultr, Lambda, Nvidia, and Broadcom is preferred.

Please send resumes to srini at zaspar dot com

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.