Ajax Jobs in San Francisco, CA

Refine Results
41 - 60 of 77 Jobs

Software Engineer, ML Inference Compiler & Deployment, GPU, CPU

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compiler to

Software Engineer, ML Inference Compiler & Deployment, AI Accelerator

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compiler to

Senior Director of Data Engineering

Visa Inc.

Foster City, California, USA

Full-time

Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhe

Machine Learning Kernel Performance Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

As a member of the Dojo Machine Learning software team, you will be responsible for developing and optimizing machine learning kernels to run on a massively parallel machine. The ideal candidate will have a strong background in kernel development and performance optimization for AI workloads, with a passion for delivering high-performance implementations, which are simple to use. Responsibilities Develop and validate datapath kernels on a massively parallel machineDesign and deliver implementat

Ray Inference Engineer

Veear

Remote

Contract

Position Summary: Designing, implementing, and maintaining distributed systems to build world-class ML platforms/products at scaleExperiment with, deploy, and manage LLMs in a production contextBenchmark and optimize inference deployments for different workloads, e.g. online vs. batch vs. streaming workloadsDiagnose, fix, improve, and automate complex issues across the entire stack to ensure maximum uptime and performanceDesign and extend services to improve functionality and reliability of the

Senior DGX Cloud AI Infrastructure Software Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing efficiency and resiliency of AI workloads, as well as developing scalable AI and Data infrastructure tools and services. Our objective is to deliver a stable, scalable environment for AI researchers, providing them with the necessary resources and scale to foster innovation. We are seeking an AI infrastructure software engineer to join our

Principal Software Engineer, TensorRT-LLM

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are now looking for a Principal Software Engineer, TensorRT-LLM ! NVIDIA is hiring experienced principal software engineer for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthroughs in areas like content creation, code generation and information synthesis, that have put AI on the precipice of an "iPhone moment". Join the team building the AI serving software which is foundational to product lines within NVIDIA

Infrastructure Cloud Security Engineer

NasTech Global, Inc.

US

Contract

Job Title: Infrastructure Cloud Security Engineer Location: Remote -US Job Type: Contract W2 Skills: GPU / TPU Experience Familiarity with hands-on IaC is a must Terraform GKE Networking Storage Python Library familiarity with: numpy/pandas/Pytorch/JAX, including optimization Experience with Nvidia and/or Google TPU hardware in GCE and GKE ML-specific Google Cloud Platform products: Parallel store, Hyperdisk ML, TCP Direct Troubleshooting and optimization of large (1000s of nodes) GKE clusters"

Senior GenAI Algorithms Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are now looking for a Senior Gen AI Algorithms Engineer! NVIDIA is seeking engineers to design, develop and optimize Artificial Intelligence solutions to diverse real-world problems. If you have a strong understanding of deep learning and in particular large language models and their multimodal variants, then this role may be a great fit for you! Collaborate and interact with internal partners, users, and members of the open source community to analyze, define and implement highly optimized A

GPU SME- 10+ yrs- Remote (US Only)

iMedhas Consulting Services

Remote

Third Party, Contract

Job description: GPU / TPU Experience Familiarity with hands on IaC is a must Terraform GKE Networking Storage Python Library familiarity with: numpy/pandas/Pytorch/JAX, including optimization Experience with Nvidia and/or Google TPU hardware in GCE and GKE ML specific Google Cloud Platform products: Parallel store, Hyperdisk ML, TCP Direct Troubleshooting and optimization of large (1000s of nodes) GKE clusters

GPU SME- 10+ yrs- Remote (US Only)

iMedhas Consulting Services

Remote

Third Party, Contract

Job description: GPU / TPU Experience Familiarity with hands on IaC is a must Terraform GKE Networking Storage Python Library familiarity with: numpy/pandas/Pytorch/JAX, including optimization Experience with Nvidia and/or Google TPU hardware in GCE and GKE ML specific Google Cloud Platform products: Parallel store, Hyperdisk ML, TCP Direct Troubleshooting and optimization of large (1000s of nodes) GKE clusters

Senior Deep Learning Software Engineer, Inference and Model Optimization

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies. Our work includes conducting applied research to improve model efficiency as well as developing an innovative software pl

Principal Site Reliability Engineer, AI Infrastructure

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for over 30 years. It's an outstanding legacy of innovation that's fueled by phenomenal technology and exceptional people. Today, we're tapping into the unlimited potenti

Senior Software Engineer, Machine Learning Inference

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world's most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPU

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in

Principal Deep Learning Software Engineer, LLM Performance

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are now looking for a Principal Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs

Data Governance Technology Lead - C13 - Tampa/Jax

Citi

Remote or Tampa, Florida, USA

Full-time

Successful candidate will be responsible for liaising between Product and Technology regarding data governance implementation for a respective Data Domain, activities to include identifying lineage, critical data elements, authoritative data sources, data quality rules, and investigating any data-related issues. The overall objective of this role is to contribute to continuous iterative exploration and investigation of attribute-level data lineage, critical data element decomposition, applicatio

Google Cloud Platform Infra SCE

Edge Global

Florida, USA

Part-time, Contract, Third Party

Greetings, We have the requirements below with our client. Kindly go through the JD below and let me know your interest Title: Google Cloud Platform Infra SCE Location: Remote(PST) Job Type: Contract Mandatory Skill: GPU / TPU Experience Terraform GKE Networking Storage Python Job description: looking for someone with the below skill sets: GPU / TPU Experience Terraform GKE Networking Storage Python Library familiarity with: numpy/pandas/Pytorch/JAX, including optimization Experience with Nv

ML Data Engineer - Healthcare Data Curation & Cleaning (1 Year Fixed Term)

Stanford University

Stanford, California, USA

Full-time

Stanford University is seeking a Big Data Architect 1 for a 1 year fixed term (possibility of renewal) to design and develop applications, test and build automation tools and support the development of Big Data architecture and analytical solutions. About Us: The Department of Biomedical Data Science merges the disciplines of biomedical informatics, biostatistics, computer science and advances in AI. The intersection of these disciplines is applied to precision health, leveraging data across th

Google Cloud Platform Cloud Platform Automation Engineer (with AI/ML focus)

Talent Groups

Remote

Third Party, Contract

Required Qualifications Cloud Architect with 10+ years of experience in managing Cloud infrastructure,migrations, automation. Multi-cloud experience with hands-on experience in the following services: GPU / TPU Experience Familiarity with hands on IaC is a must Terraform Container & Orchestration: o GKE (Google Cloud Platform) o Docker o Helm o Istio o fluxcd / ArgoCD o Prometheus o Grafana o SysDig / DataDog (monitoring) o ELK/EFS Log Aggregation o NGiNX Ingress Controller / ALB Ingress Control