GeForce NOW is Nvidia’s Cloud Gaming service, streaming games at the highest quality to any and every user, regardless of their device types and capabilities—low-end PCs, Macs, or mobile devices. Using the most advanced GPUs and Nvidia proprietary software, GeForce NOW transforms the gaming experience with always up-to-date games on always the latest hardware, a streaming experience rivaling that of a local PC, and near-instant launch—just click and play! For more details, see http://www.geforce.com/geforce-now
We are looking for a Senior System Software Engineer who sees the big picture of Cloud Computing and is deeply technical, creative, and hands-on. In this role, you are required to leverage a deep understanding of programming languages, distributed systems, multithreading, cloud services, and system software to design, build, and deploy system services that run in the GeForce NOW cloud. Your work will craft scalable and efficient cloud services to drive Visual Computing, Deep Learning, and Artificial Intelligence. We are looking for an experienced engineer to architect and deploy the production-grade AI agents that power the future of our e-commerce platform. This is a high-impact, hands-on role in which you will build autonomous systems to solve complex challenges in personalization, logistics, and customer experience. Beyond writing code, you will serve as a technical mentor, helping our broader engineering team master modern AI development practices.
What you will be doing:
Architecting Autonomous Systems: Design and optimize production-ready AI agents using frameworks like LangChain, LangGraph, and CrewAI to handle complex, multi-step e-commerce workflows.
Enhancing Customer Experience: Build intelligent conversational agents for order management, personalized shopping assistance, and automated support.
Optimizing Operations: Develop agents focused on inventory forecasting, supply chain planning, and dynamic pricing models to drive business efficiency.
Securing the Platform: Create real-time monitoring systems for fraud detection and risk assessment to protect transactions and user data.
Building Data Pipelines: Construct robust RAG (Retrieval-Augmented Generation) pipelines and sophisticated memory management systems to provide agents with accurate, real-time context.
Ensuring System Reliability: Implement rigorous testing, hallucination mitigation, and human-in-the-loop mechanisms to make AI behavior predictable and safe.
Integrating Technical Stacks: Connect AI agents to our core backend services, REST APIs, and payment gateways to deliver a seamless end-to-end user experience.
Mentoring the Team: Lead workshops, conduct pair-programming sessions, and develop reference architectures to help fellow engineers transition into confident AI contributors.
What we need to see:
Solid Engineering Foundation: A Bachelor’s or Master’s degree in Computer Science or a related technical field or equivalent experience.
Proven Track Record: At least 8+ years of professional software engineering experience, including 3+ years specifically building and deploying production-level LLM applications.
Language Mastery: Expert proficiency in Python, Java, and GoLang for building high-performance backend services.
Framework Expertise: Hands-on experience with LangChain and LangGraph is required; familiarity with AutoGen or LlamaIndex is highly valued.
LLM Proficiency: Deep understanding of model integration (OpenAI, Anthropic, Llama), including prompt engineering, tool-calling, and ReAct patterns.
Infrastructure Skills: Practical experience with AWS, Docker, Kubernetes, and SQL/NoSQL databases to manage scalable, containerized applications.
MLOps Knowledge: A strong grasp of CI/CD pipelines, versioning for AI components, and observability tools for monitoring non-deterministic systems.
Technical Communication: The ability to explain complex architectural decisions and AI concepts to both technical peers and business stakeholders.
Ways to stand out from the crowd:
E-commerce Background: Previous experience building features for high-traffic consumer platforms or large-scale retail environments.
Advanced AI Tooling: Practical familiarity with the Nemo Agent Toolkit or contributing to open-source agentic AI projects.
Safety & Ethics: A background in implementing AI guardrails, bias mitigation, and security hardening for autonomous systems.
Performance Optimization: Experience specifically building high-performance microservices designed to support heavy AI workloads.
With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Senior Architect role to design and implement high-performance AI communication and memory libraries while driving hardware-software co-optimization across GPUs, DPUs, NICs, and interconnects at NVIDIA.
Contribute to Isaac Lab as a Software Engineering Intern focused on building scalable simulation, perception-in-the-loop RL, and sim-to-real capabilities for robot learning at NVIDIA.
Lead application and cloud security for a fast-growing AI EdTech platform, embedding with engineering teams to build secure-by-default systems and developer-friendly security workflows.
Zoox is hiring a skilled C++ software engineer to design and maintain high-performance, safety-critical drivers for lidar, radar, and camera sensors that feed the autonomous driving stack.
Senior Software Engineer needed to develop high-performance, mission-critical software and algorithms for Anduril’s autonomy and sensor-fusion systems.
Lead on-prem and cloud deployments of a cutting-edge AI platform for semiconductor and electronics customers as a Senior Software Engineer based in the Bay Area.
NVIDIA's NVHPC compilers & tools group seeks a Senior HPC Performance Engineer to analyze and optimize high-performance applications across CPU and GPU architectures and guide compiler and application engineering improvements.
NBCUniversal is hiring part-time Academic Year Software Engineering interns in Stamford, CT to support observability, automation, and monitoring efforts within its Operations & Technology division.
The Real Deal seeks a Full Stack Developer to build scalable, data-driven web applications and intuitive user experiences for its high-traffic real estate products.
Graphite is seeking a Senior Frontend Engineer to lead the frontend architecture and help build a real-time, collaborative code review platform that accelerates developer velocity.
Evaluate and optimize real-world AI workloads on emerging hardware platforms to bridge the gap between expected and observed system performance for OpenAI’s infrastructure.
Lead and mentor cloud-focused engineering teams to deliver scalable, production-ready systems that expand access to technology-enabled pediatric care.
Experienced Angular frontend developer needed to implement accessible, component-driven web interfaces for a federal modernization program and collaborate with UX, backend, and product teams.
Experienced backend-focused full-stack engineer to build and maintain scalable Ruby on Rails services integrated with React and GraphQL for a healthcare data intelligence platform.
Autodesk's Enterprise Data Management team is hiring an early-career Software Engineer to build backend systems and data features that ensure reliable customer data and insights.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
71 jobs