Browse 59 exciting jobs hiring in Cuda now. Check out companies hiring such as Point72, OpenAI, NVIDIA in Phoenix, Salt Lake City, Santa Ana.
Point72 is hiring a Machine Learning Infrastructure Engineer to build and operate scalable GenAI infrastructure that accelerates model development and production across cloud and on-prem environments.
Evaluate and optimize real-world AI workloads on emerging hardware platforms to bridge the gap between expected and observed system performance for OpenAI’s infrastructure.
NVIDIA's NVHPC compilers & tools group seeks a Senior HPC Performance Engineer to analyze and optimize high-performance applications across CPU and GPU architectures and guide compiler and application engineering improvements.
NVIDIA is seeking a Solutions Architect to lead OEM-based AI Factory architecture and technical strategy for Federal sovereign AI deployments.
Lead the design and scaling of distributed training infrastructure at Metamorphic to enable large-scale foundation-model experiments across thousands of GPUs.
Senior GPU performance engineer role at Zoox to benchmark, profile, and optimize GPU and CPU workloads to maximize throughput and meet real-time constraints for autonomous vehicle systems.
The University of Chicago's CTDS is hiring a Senior Platform Engineer to lead production support, CI/CD pipelines, monitoring, and security automation across hybrid cloud and on‑prem translational data science platforms.
Work with Meshy's engineering and research teams as a Fullstack Engineer Intern to build scalable full-stack features and AI-powered tooling that reach millions of users.
SHI is hiring an early-career Associate Solutions Engineer – AI to help translate modern ML techniques into scalable, production-ready enterprise solutions.
Senior Machine Learning Engineer needed to transform prototype AI models into optimized, production-ready systems for secure, distributed public sector and edge deployments.
Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.
Lead development of Intel's neuromorphic AI compiler and runtime to enable production-grade, high-performance physical AI applications across hardware and software ecosystems.
Lead developer advocacy for NVIDIA's Newton and Warp toolchains, partnering with industry, academia, and ISVs to drive GPU-accelerated, differentiable simulation adoption across robotics.
Lead the design and delivery of production-ready online sensor calibration algorithms for Toyota’s autonomous driving systems while optimizing for accuracy, robustness, and constrained runtime environments.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Encord is hiring a Machine Learning Engineer to research, adapt, and productionize cutting-edge computer vision and deep learning methods within a fast-growing AI infrastructure startup.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
NVIDIA's ADI team seeks a Senior Software Engineer to design and implement high-performance C++/CUDA libraries for accelerating GPU data processing and contribute to major open-source projects.
Lead the architecture and delivery of real-time RF sensor software at STR, transitioning algorithms to optimized C/C++ implementations and driving open-system integration across distributed platforms.
Lead design and implementation of real-time computer vision and ML algorithms for minimally invasive robotic surgery at a market-leading medical robotics company.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Take a leading role developing state-of-the-art visual intelligence models and systems at an ambitious AI research-focused company based in Palo Alto.
ClearEdge is hiring an HPC Software Engineer III to lead development and performance optimization of compute-intensive, parallel/distributed software for high-impact DoD programs.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
NVIDIA is seeking a Senior System Software Engineer to architect and implement CUDA driver features for Windows, advancing GPU computing across AI, graphics, and system workloads.
Applied Research Scientist role to design and implement cutting-edge computer vision and generative models that move research from prototype to production in creative simulation tools.
Lead and scale NVIDIA's embedded AI software go-to-market and partner co‑sales to accelerate ISV, OEM, and system integrator adoption of NVIDIA's platform.
Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.
A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.
Lead the development of production-ready software for SpaceX’s metal 3D printing systems, driving controls, data acquisition, and in-process monitoring to improve printed hardware outcomes.
Work on advanced graph neural network models and 3D reconstruction pipelines to power AI-first generative design and BIM generation at an early-stage startup focused on transforming construction design and estimation.
A systems researcher/engineer role focused on prototyping, benchmarking, and system-level analysis of AI and data-center workloads to drive Intel's next-generation architecture and product decisions.
Lead the architecture and delivery of NVIDIA’s Retail & CPG product platform, blending agentic AI and accelerated computing to enable scalable retail, supply chain, and commerce solutions.
Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.
Lead developer strategy and partnerships to drive adoption of NVIDIA's core CUDA Math Libraries, with a focus on mixed-precision enablement and high-performance numerical computing.
General Robotics seeks an ML Systems Engineer in Redmond to productionize and optimize real-time, GPU-accelerated model serving and ML infrastructure for autonomous robotics.
Anduril is hiring a Senior FPGA Engineer in Costa Mesa to lead Xilinx-based FPGA design, verification, and bring-up for next-generation software defined radios and EW platforms.
Senior Staff TPM role leading portfolio-level IaaS and GPU-generation programs, shaping NPI frameworks, and coaching TPMs at a rapidly scaling AI infrastructure company.
Drive production-quality integrations of NVIDIA Grove into Dynamo and leading open-source AI frameworks, delivering adapters, runtime components, and developer tooling for scalable training and inference.
KLA is seeking an experienced AI Software Engineer to build and maintain scalable Generative AI and LLM solutions deployed to cloud production environments.
Join vCluster Labs as an AI Infrastructure Specialist to lead technical pre-sales and production deployments of GPU-powered Kubernetes on bare metal, turning early customer engagements into scalable playbooks.
Lumafield is hiring a Senior Embedded Systems Engineer to design and ship high-performance firmware and Linux-based edge software for next-generation CT scanning products in San Francisco.
Lead the development of state estimation and localization algorithms for SandboxAQ’s MagNav team, applying expertise in C++, sensor integration, and navigation theory to novel GNSS-alternative systems.
Toyota Research Institute is hiring a Senior Machine Learning Engineer to build ML infrastructure, integrate and fine-tune LLMs, and operationalize multimodal research workflows for robotics, autonomy, energy, and materials programs.
Work on cutting-edge embedded graphics and interaction software for intraoperative navigation and guidance within a leading surgical-robotics company.
Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.
Lead NVIDIA’s embedded AI software go-to-market and partner co-sales to drive broad ISV, OEM, and system integrator adoption of NVIDIA AI platforms.
Meshy is hiring an AI 3D Dataset Engineer to design and operate scalable 3D data pipelines, tooling, and quality systems that enable high-performance generative 3D models.
Below 50k*
0
|
50k-100k*
2
|
Over 100k*
57
|