Gpu Inference Jobs

Browse 19 exciting jobs hiring in Gpu Inference now. Check out companies hiring such as Point72, Awesome Motive, Zoox in Anchorage, New Orleans, Chandler.

VIEW COMPANIES

Machine Learning Infrastructure Engineer, GenAI Technology

Point72 Hybrid United States

VIEW

Posted 13 hours ago

Point72 is hiring a Machine Learning Infrastructure Engineer to build and operate scalable GenAI infrastructure that accelerates model development and production across cloud and on-prem environments.

ML Research Engineer (Model Training)

Awesome Motive Hybrid Palo Alto

VIEW

Posted 15 hours ago

Help architect and operate the systems that take neuroscience datasets from raw experiments through large-scale model training, evaluation, and optimized production inference at Metamorphic.

Senior Engineering Manager, ML Platform

Zoox Hybrid Foster City, CA

VIEW

Posted 2 days ago

Lead and grow Zoox's ML Platform engineering organization to deliver scalable training and low-latency inference infrastructure for large foundation and RL models across vehicle and cloud environments.

Senior Machine Learning Engineer

webAI Hybrid No location specified

VIEW

Posted 4 days ago

Senior Machine Learning Engineer needed to transform prototype AI models into optimized, production-ready systems for secure, distributed public sector and edge deployments.

Senior Software Engineer - AI Inference

Jobgether Hybrid US

VIEW

Posted 4 days ago

Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.

Sr. Staff Software Engineer, Systems Infrastructure

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 10 days ago

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Machine Learning Engineer, Platform Integrations

TwelveLabs Hybrid San Francisco

VIEW

Posted 14 days ago

Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.

ML Ops Infrastructure Engineer

Deepgram Hybrid Remote

VIEW

Posted 15 days ago

Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.

Multimodal AI Model Optimization Research Engineer

Tavus Hybrid No location specified

VIEW

Posted 16 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Member of Technical Staff, Inference

Runway Hybrid No location specified

VIEW

Posted 17 days ago

Inclusive & Diverse

Collaboration over Competition

Fast-Paced

Growth & Learning

Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.

Member of Technical Staff, Research Engineer (GPU Performance)

Runway Hybrid No location specified

VIEW

Posted 17 days ago

Inclusive & Diverse

Collaboration over Competition

Fast-Paced

Growth & Learning

A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.

Senior Manager, AI Platform Engineering

True Anomaly Hybrid Denver, CO or Long Beach, CA

VIEW

Posted 17 days ago

Lead and build True Anomaly’s AI platform and engineering team to deliver production-grade model hosting, agent infrastructure, and enterprise AI tooling that embed AI across the company.

Model Serving Engineer

Fundamental Hybrid United States

VIEW

Posted 18 days ago

Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.

Senior Software Engineer, ML Infrastructure

Decagon Hybrid San Francisco

VIEW

Posted 25 days ago

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

ML Research Engineer (Performance Engineering)

Awesome Motive Hybrid Palo Alto

VIEW

Posted 25 days ago

Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.

Robot Learning Engineer

Innate Hybrid Palo Alto

VIEW

Posted 26 days ago

Work on training and deploying large-scale ML systems for physical robots while building the infrastructure and pipelines to operate them in production.

Senior ML Ops Engineer

Wizard Hybrid Remote - USA

VIEW

Posted 26 days ago

Wizard AI is hiring a Senior MLOps Engineer to own and scale the production ML lifecycle for a real-time inference platform behind a conversational shopping agent.

Member of the Business Staff - Compute Markets

Andromeda Cluster Hybrid San Francisco

VIEW

Posted 26 days ago

Andromeda Cluster is hiring an Infrastructure Manager to scale global GPU compute supply and demand matching by sourcing suppliers, optimizing utilization, and negotiating commercial terms.

Developer Relations Manager, Cloud Service Provider

NVIDIA Hybrid US, WA, Redmond

VIEW

Posted 27 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA seeks a seasoned Developer Relations Manager to partner with hyperscaler AI teams, provide hands-on technical enablement for NVIDIA AI software, and drive developer adoption and feedback into the product roadmap.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks