Browse 17 exciting jobs hiring in Tensorrt now. Check out companies hiring such as NVIDIA, Zoox, Shi in Springfield, Fort Wayne, Virginia Beach.
Senior Architect role to design and implement high-performance AI communication and memory libraries while driving hardware-software co-optimization across GPUs, DPUs, NICs, and interconnects at NVIDIA.
NVIDIA is seeking a Solutions Architect to lead OEM-based AI Factory architecture and technical strategy for Federal sovereign AI deployments.
Lead and grow Zoox's ML Platform engineering organization to deliver scalable training and low-latency inference infrastructure for large foundation and RL models across vehicle and cloud environments.
Senior GPU performance engineer role at Zoox to benchmark, profile, and optimize GPU and CPU workloads to maximize throughput and meet real-time constraints for autonomous vehicle systems.
SHI is hiring an early-career Associate Solutions Engineer – AI to help translate modern ML techniques into scalable, production-ready enterprise solutions.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
NVIDIA is hiring a Principal Software Engineer to lead architecture, reliability, and production hardening of enterprise agentic AI applications and shared platform services.
Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.
Lead and grow a high-performing edge software engineering team to build and scale AI-enabled IoT solutions deployed across distributed devices for a fast-growing intelligent site technology company.
Wyetech is seeking an experienced Software Engineer 2 to productionize ML research into high-performance, containerized systems for federal customers while working hybrid from Laurel, MD.
Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.
Help build the ML platform powering enterprise agentic automation by owning production AI features end-to-end at Sola’s NYC headquarters.
Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
13
|