An applied research team within NVIDIA’s Networking Systems & Software Architecture group is solving some of AI’s hardest infrastructure problems. The team builds systems-level software that moves data between GPUs, nodes, and storage at the speed modern AI demands—spanning low-level transport optimization, hardware-software co-design, and communication frameworks that plug directly into production AI stacks. The team's charter expands into emerging domains including quantum computing interconnects.
The Senior Architect role is to own modules and projects end-to-end—from scoping research questions to shipping production code. It calls for a recognized expert who drives technical decisions, pulls in ideas from research and industry, and regularly prototypes new approaches to prove a point. The work lives at the boundary of applied research and production engineering!
What you will be doing:
Architecting and implementing high-performance communication and memory management libraries for distributed AI
Driving hardware-software co-optimization with GPU, DPU, NIC, and switch teams through GPUDirect RDMA, NVLink, and next-generation interconnects
Profiling and optimizing data movement across GPU memory, system DRAM, NVMe, and network fabrics
Integrating networking capabilities into AI serving stacks such as vLLM, SGLang, and TensorRT-LLM
Contributing to and maintaining open-source projects, mentoring engineers, conducting design reviews, and prototyping experimental technologies to evaluate their viability
What we need to see:
8+ years in systems software and/or networking with demonstrated ownership of complex projects.
MS, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
Solid understanding of high-performance networking: InfiniBand, RoCE, RDMA, NVLink, GPUDirect.
Strong C/C++/Rust systems programming with comfort in performance profiling and low-level debugging.
Understanding of ML systems concepts—transformer architectures, KV cache mechanics, model parallelism, or distributed training and inference patterns.
Ways to stand out from the crowd:
Knowledge of ML inference frameworks (vLLM, SGLang, TensorRT-LLM) and their communication requirements.
Knowledge of storage networking (NVMe-oF, GPUDirect Storage, S3).
Background of Reinforcement Learning systems.
With competitive salaries and a comprehensive benefits package, NVIDIA is widely regarded as one of the most desirable technology employers in the world. Our teams are composed of some of the most forward‑thinking and driven engineers in the industry, and we continue to grow rapidly. If you are a senior data engineer passionate about building large‑scale, high‑impact data platforms, we’d love to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is seeking a Solutions Architect to lead OEM-based AI Factory architecture and technical strategy for Federal sovereign AI deployments.
NVIDIA's NVHPC compilers & tools group seeks a Senior HPC Performance Engineer to analyze and optimize high-performance applications across CPU and GPU architectures and guide compiler and application engineering improvements.
Lead the development of scalable backend systems and CV-driven features for a fast-moving youth-sports platform, shaping automated highlights and video analytics used by millions.
Software Engineer to develop and improve high-availability web services and apps for Trimble Maps, with an emphasis on strong coding, problem solving, and iterative delivery.
Senior software process engineer for Samsung's eCommerce platform, responsible for driving scalable architecture, data privacy, and SDLC best practices.
Experienced Angular frontend developer needed to implement accessible, component-driven web interfaces for a federal modernization program and collaborate with UX, backend, and product teams.
Experienced Java Technical Lead/Architect needed to provide hands-on architecture, design reviews, and leadership for large-scale enterprise systems in Santa Clara.
Work remotely as a Front-End Application Developer building accessible, scalable React/Angular applications for environmental data platforms while contributing across the full stack.
Lead application and cloud security for a fast-growing AI EdTech platform, embedding with engineering teams to build secure-by-default systems and developer-friendly security workflows.
Lead on-prem and cloud deployments of a cutting-edge AI platform for semiconductor and electronics customers as a Senior Software Engineer based in the Bay Area.
ABC Fitness is hiring a Software Development Intern to contribute to web and microservice projects during an 8-week summer program while gaining mentorship and real-world engineering experience.
Lead and mentor cloud-focused engineering teams to deliver scalable, production-ready systems that expand access to technology-enabled pediatric care.
Ironclad is hiring an AI-native GTM Engineer to architect and deploy autonomous agent systems and integrations that automate end-to-end marketing workflows and drive measurable revenue impact.
Lithic is looking for an Engineering Manager to lead the Processing team responsible for low-latency, highly available transaction processing and network peering across card networks.
Senior Angular/Full-Stack Engineer to drive front-end architecture and build provider-facing treatment planning and eligibility UIs at Wellfit, working across Product, Design, and backend teams.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
71 jobs