Llm Serving Jobs

Browse 14 exciting jobs hiring in Llm Serving now. Check out companies hiring such as FriendliAI, Zencore, Intel in Laredo, Tampa, Cincinnati.

VIEW COMPANIES

Software Engineer - Senior Backend

FriendliAI Hybrid San Francisco

VIEW

Posted 3 days ago

Help architect and operate FriendliAI’s enterprise inference platform as a Senior Backend Engineer focused on APIs, multi-tenant SaaS features, and data/system reliability at scale.

Principal Architect, AI/ML

Zencore Hybrid Remote

VIEW

Posted 4 days ago

Lead design and delivery of secure, scalable, production-grade AI/ML solutions as Zencore’s Principal Architect, advising clients and shaping cloud-native architectures.

I Software Engineer – Agentic AI System

Intel Hybrid US, California, Santa Clara

VIEW

Posted 5 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Customer-Centric

Snacks

Onsite Gym

Family Coverage (Insurance)

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Learning & Development

Paid Time-Off

401K Matching

Maternity Leave

Paternity Leave

Intel is hiring an AI Software Engineer to develop deployment, data, and evaluation infrastructure for agentic AI frameworks and model-serving systems.

Manager, ML & AI — AI Platform

Zapier Hybrid San Francisco

VIEW

Posted 8 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Lead Zapier's AI Platform team to build reusable model-serving, evaluation, and MLOps tooling that helps product teams ship AI features quickly, safely, and cost-effectively.

Sr. Staff Software Engineer, Systems Infrastructure

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 10 days ago

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Staff AI Engineer

MLabs Hybrid No location specified

VIEW

Posted 13 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Senior Engineer - Platform

Straia Hybrid San Francisco

VIEW

Posted 14 days ago

Straia seeks a Senior Platform Engineer to design and operate the data movement, model-serving, and platform infrastructure that powers low-latency AI analytics for higher education.

AI Engineering Intern

Actian Corporation Hybrid US-Remote

VIEW

Posted 16 days ago

AI Engineering Intern at Actian to help integrate ML models into production applications while gaining hands-on experience with model serving, data pipelines, and full-stack development.

Principal Software Engineer, AI & Matching

Bumble Inc. Hybrid US TX Austin

VIEW

Posted 20 days ago

Lead the design and scaling of Bumble’s matching, recommendation, and agentic AI systems to deliver low-latency, ML-powered experiences across the product.

Senior Software Engineer, AI Frameworks

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 22 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Drive production-quality integrations of NVIDIA Grove into Dynamo and leading open-source AI frameworks, delivering adapters, runtime components, and developer tooling for scalable training and inference.

QA Engineer

FriendliAI Hybrid San Francisco

VIEW

Posted 24 days ago

Shape and own the QA strategy for FriendliAI’s inference platform, covering backend, frontend, model deployments, and novel validation for LLM inference quality.

Software Engineer – Senior Backend

FriendliAI Hybrid San Francisco

VIEW

Posted 24 days ago

Senior Backend Engineer needed to design and operate production-grade APIs and backend systems for a fast-moving AI inference platform serving enterprise deployments.

Senior Machine Learning Engineer

Toyota Research Institute Hybrid Los Altos, CA

VIEW

Posted 24 days ago

Toyota Research Institute is hiring a Senior Machine Learning Engineer to build ML infrastructure, integrate and fine-tune LLMs, and operationalize multimodal research workflows for robotics, autonomy, energy, and materials programs.

Senior Software Engineer, ML Infrastructure

Decagon Hybrid San Francisco

VIEW

Posted 25 days ago

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks