Inference Optimization Jobs

Browse 17 exciting jobs hiring in Inference Optimization now. Check out companies hiring such as Hinge Health, Awesome Motive, webAI in Kansas City, Milwaukee, Honolulu.

VIEW COMPANIES

Staff Data Scientist

Hinge Health Hybrid San Francisco

VIEW

Posted 6 hours ago

Lead the measurement, experimentation, and data architecture for HingeSelect as the first dedicated Staff Product Data Scientist driving causal analysis, funnel optimization, and supply-demand modeling.

ML Research Engineer (Model Training)

Awesome Motive Hybrid Palo Alto

VIEW

Posted 11 hours ago

Help architect and operate the systems that take neuroscience datasets from raw experiments through large-scale model training, evaluation, and optimized production inference at Metamorphic.

Senior Machine Learning Engineer

webAI Hybrid No location specified

VIEW

Posted 3 days ago

Senior Machine Learning Engineer needed to transform prototype AI models into optimized, production-ready systems for secure, distributed public sector and edge deployments.

Senior Software Engineer - AI Inference

Jobgether Hybrid US

VIEW

Posted 4 days ago

Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.

AI Inference Engineer - Model Optimization & Deployment

Zoox Hybrid No location specified

VIEW

Posted 8 days ago

Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.

Sr. Staff Software Engineer, Systems Infrastructure

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 10 days ago

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Staff AI Engineer

MLabs Hybrid No location specified

VIEW

Posted 13 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Machine Learning Engineer, Platform Integrations

TwelveLabs Hybrid San Francisco

VIEW

Posted 14 days ago

Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.

Senior ML/AI Engineer

Sandbar Hybrid New York City

VIEW

Posted 15 days ago

Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.

Multimodal AI Model Optimization Research Engineer

Tavus Hybrid No location specified

VIEW

Posted 16 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Member of Technical Staff

Awesome Motive Hybrid San Francisco

VIEW

Posted 16 days ago

Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.

Data Scientist - Model Optimization

Quadric, Inc Hybrid No location specified

VIEW

Posted 18 days ago

Lead the development of custom quantization algorithms and low-precision techniques to maximize model performance on Quadric's Chimera GPNPU from our Burlingame engineering office.

Senior Software Engineer, ML Infrastructure

Decagon Hybrid San Francisco

VIEW

Posted 24 days ago

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

ML Research Engineer (Performance Engineering)

Awesome Motive Hybrid Palo Alto

VIEW

Posted 25 days ago

Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.

Senior ML Ops Engineer

Wizard Hybrid Remote - USA

VIEW

Posted 26 days ago

Wizard AI is hiring a Senior MLOps Engineer to own and scale the production ML lifecycle for a real-time inference platform behind a conversational shopping agent.

AI Engineer

Varick Agents Hybrid No location specified

VIEW

Posted 27 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Developer Advocate Engineer

Dexmate Hybrid Santa Clara

VIEW

Posted 29 days ago

Lead developer-facing content and sample projects that help ML engineers train, fine-tune, and deploy models on Dexmate humanoid robots while shipping production-quality code weekly.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks