Rise Jobs & Careers icon Llm Inference Jobs

Browse 21 exciting jobs hiring in Llm Inference now. Check out companies hiring such as Spotify, Jobgether, Sunday in Miami, Virginia Beach, Oceanside.

Photo of the Rise User
Posted 3 days ago
Inclusive & Diverse
Empathetic
Take Risks
Transparent & Candid
Feedback Forward
Mission Driven
Collaboration over Competition
Work/Life Harmony
Maternity Leave
Paternity Leave
Snacks
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
401K Matching
Paid Sick Days
Paid Time-Off
Paid Volunteer Time

Drive product decisions for Spotify Premium as a Data Scientist focused on experimentation, AI-enabled analytics, and insights that increase conversion and retention.

Photo of the Rise User

Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.

Photo of the Rise User

Contribute to state-of-the-art robot learning and on-robot deployment at a fast-moving consumer robotics startup focused on dexterous home manipulation.

Photo of the Rise User
Posted 8 days ago

Aviator Health seeks a Technical Ex‑Founder to lead 0→1 consumer product development and build autonomous agent systems that navigate real healthcare workflows from our NYC office.

Photo of the Rise User

Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.

Photo of the Rise User

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Photo of the Rise User
Bosch Group Hybrid 2555 Smallman St, Pittsburgh, PA 15222, USA
Posted 12 days ago

Lead cutting-edge research on multimodal foundation models and efficient GenAI at Bosch Research Pittsburgh, translating innovations into industrial and product impact while publishing at top-tier venues.

MLabs Hybrid No location specified
Posted 13 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Posted 16 days ago

Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.

Photo of the Rise User
Posted 16 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Photo of the Rise User
Posted 16 days ago

Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.

Photo of the Rise User
Posted 19 days ago

Pluralsight seeks an experienced Data Scientist to design, validate, and deploy machine learning and NLP solutions that drive product and business impact.

Photo of the Rise User
ASAPP Hybrid No location specified
Posted 23 days ago

Lead the Core GenerativeAgent team to design, build, and deploy low-latency, enterprise-grade conversational voice AI combining LLMs with speech-to-text, text-to-speech, and real-time streaming pipelines.

FriendliAI Hybrid San Francisco
Posted 24 days ago

Shape and own the QA strategy for FriendliAI’s inference platform, covering backend, frontend, model deployments, and novel validation for LLM inference quality.

Photo of the Rise User

Senior technical role focused on researching, engineering, and scaling privacy-preserving ML and LLM alignment solutions across LinkedIn's platforms.

Photo of the Rise User

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

Work on FriendliAI's core developer experience by owning the Python SDK and CLI, packaging pipelines, and internal dev tools that enable reliable integrations with our inference and agent platform.

Varick Agents Hybrid No location specified
Posted 27 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Photo of the Rise User
Posted 28 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.

Photo of the Rise User

Help shape Baseten's model ecosystem by combining hands-on engineering, developer education, and product thinking to improve model discovery, evaluation, and adoption.

Photo of the Rise User
Posted last month

Senior Staff AI Engineer to lead research and productionization of privacy-preserving ML (differential privacy, federated learning, secure computation) and LLM alignment across LinkedIn’s AI platforms.

Employment type
Remote/Onsite
Application Type
Date Posted
Department
Work Experience
Industries
Skills
Company size
Funding
Company Culture
Benefits & Perks
Company Rating
Salary (USD)
Keywords to Exclude

How much do llm inference jobs pay?

Below 50k*
0
0%
50k-100k*
0
0%
Over 100k*
1
100%
*average yearly salary (USD)

Top companies hiring for llm inference jobs

Best cities to find llm inference jobs