Browse 13 exciting jobs hiring in Vllm now. Check out companies hiring such as NVIDIA, Zencore, Jobgether in Toledo, Madison, Phoenix.
Senior Architect role to design and implement high-performance AI communication and memory libraries while driving hardware-software co-optimization across GPUs, DPUs, NICs, and interconnects at NVIDIA.
Lead design and delivery of secure, scalable, production-grade AI/ML solutions as Zencore’s Principal Architect, advising clients and shaping cloud-native architectures.
Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.
Runpod is seeking a senior Developer Relations & Community Manager to create technical content, run large-scale community operations (Discord primary), and drive developer adoption of its AI infrastructure platform.
Intel is hiring an AI Software Engineer to develop deployment, data, and evaluation infrastructure for agentic AI frameworks and model-serving systems.
Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.
Help scale production ML infrastructure and retrieval systems at Foxglove to enable high-performance semantic search and data mining over multimodal robotics data.
Zoox is looking for a Senior Machine Learning Engineer to design and productionize Vision-Language-Action models for real-time scene understanding on its robotaxi platform.
Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.
Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.
Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.
Prime Intellect seeks a Research Engineer to build and optimize scalable RL training and orchestration infrastructure that powers frontier agentic models.
Help build the ML platform powering enterprise agentic automation by owning production AI features end-to-end at Sola’s NYC headquarters.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
12
|