Llm Inference Jobs

Browse 21 exciting jobs hiring in Llm Inference now. Check out companies hiring such as Spotify, Jobgether, Sunday in Miami, Virginia Beach, Oceanside.

VIEW COMPANIES

Data Scientist - Subscriptions

Spotify Hybrid New York, NY

VIEW

Posted 3 days ago

Inclusive & Diverse

Empathetic

Take Risks

Transparent & Candid

Feedback Forward

Mission Driven

Collaboration over Competition

Work/Life Harmony

Maternity Leave

Paternity Leave

Snacks

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

401K Matching

Paid Sick Days

Paid Time-Off

Paid Volunteer Time

Drive product decisions for Spotify Premium as a Data Scientist focused on experimentation, AI-enabled analytics, and insights that increase conversion and retention.

Senior Software Engineer - AI Inference

Jobgether Hybrid US

VIEW

Posted 4 days ago

Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.

Machine Learning Research Engineer/Scientist

Sunday Hybrid Redwood City

VIEW

Posted 7 days ago

Contribute to state-of-the-art robot learning and on-robot deployment at a fast-moving consumer robotics startup focused on dexterous home manipulation.

Technical Ex-Founder

Awesome Motive Hybrid New York

VIEW

Posted 8 days ago

Aviator Health seeks a Technical Ex‑Founder to lead 0→1 consumer product development and build autonomous agent systems that navigate real healthcare workflows from our NYC office.

AI Inference Engineer - Model Optimization & Deployment

Zoox Hybrid No location specified

VIEW

Posted 9 days ago

Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.

Sr. Staff Software Engineer, Systems Infrastructure

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 10 days ago

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

AI Research Scientist - GenAI

Bosch Group Hybrid 2555 Smallman St, Pittsburgh, PA 15222, USA

VIEW

Posted 12 days ago

Lead cutting-edge research on multimodal foundation models and efficient GenAI at Bosch Research Pittsburgh, translating innovations into industrial and product impact while publishing at top-tier venues.

Staff AI Engineer

MLabs Hybrid No location specified

VIEW

Posted 13 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Senior ML/AI Engineer

Sandbar Hybrid New York City

VIEW

Posted 16 days ago

Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.

Multimodal AI Model Optimization Research Engineer

Tavus Hybrid No location specified

VIEW

Posted 16 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Member of Technical Staff

Awesome Motive Hybrid San Francisco

VIEW

Posted 16 days ago

Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.

Data Scientist

Pluralsight Hybrid Remote

VIEW

Posted 19 days ago

Pluralsight seeks an experienced Data Scientist to design, validate, and deploy machine learning and NLP solutions that drive product and business impact.

Lead AI/ML Engineer

ASAPP Hybrid No location specified

VIEW

Posted 23 days ago

Lead the Core GenerativeAgent team to design, build, and deploy low-latency, enterprise-grade conversational voice AI combining LLMs with speech-to-text, text-to-speech, and real-time streaming pipelines.

QA Engineer

FriendliAI Hybrid San Francisco

VIEW

Posted 24 days ago

Shape and own the QA strategy for FriendliAI’s inference platform, covering backend, frontend, model deployments, and novel validation for LLM inference quality.

Staff AI Engineer, AI Privacy Specialist

LinkedIn Hybrid Sunnyvale, CA

VIEW

Posted 25 days ago

Senior technical role focused on researching, engineering, and scaling privacy-preserving ML and LLM alignment solutions across LinkedIn's platforms.

Senior Software Engineer, ML Infrastructure

Decagon Hybrid San Francisco

VIEW

Posted 25 days ago

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

Software Engineer – Python Developer Tools

FriendliAI Hybrid San Francisco

VIEW

Posted 25 days ago

Work on FriendliAI's core developer experience by owning the Python SDK and CLI, packaging pipelines, and internal dev tools that enable reliable integrations with our inference and agent platform.

AI Engineer

Varick Agents Hybrid No location specified

VIEW

Posted 27 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Developer Relations Manager – AI Natives

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 28 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.

Software Engineer - Model Developer Ecosystem

Baseten Hybrid San Francisco

VIEW

Posted 30 days ago

Help shape Baseten's model ecosystem by combining hands-on engineering, developer education, and product thinking to improve model discovery, evaluation, and adoption.

Senior Staff AI Engineer, AI Privacy Expert

LinkedIn Hybrid Mountain View, CA

VIEW

Posted last month

Senior Staff AI Engineer to lead research and productionization of privacy-preserving ML (differential privacy, federated learning, secure computation) and LLM alignment across LinkedIn’s AI platforms.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks