LiteLLM is the world’s most popular AI Gateway used by the largest companies (Adobe, Netflix, NASA, etc.) in the world to give their developers access to LLMs and adjacent services (MCP’s, Vector Stores, etc.).
Companies use LiteLLM Enterprise once they put LiteLLM into production and need enterprise features like Prometheus metrics (production monitoring) and need to give LLM access to a large number of people with SSO (secure sign on) or JWT (JSON Web Tokens).
We are hiring an exceptional engineer to own release infrastructure and release security at LiteLLM. This is an opportunity to join us in-person as an early employee and make a large impact at a high growth start-up. You will own a critical part of the company: making sure we can ship secure, reliable releases on a consistent cadence with a high degree of autonomy and ownership.
We work 5 days per week in our SF office, approximately 60 hours per week in total.
We are looking for a software engineer with a strong background in infrastructure, CI/CD, and release engineering. You should be comfortable working across Helm, Terraform, release automation, testing systems, and the developer infrastructure needed to guarantee stable releases. This is a hands-on role.
You should be able to investigate test failures, distinguish real regressions from flaky tests, write Python, fix minor test issues, remove dead tests, and improve the overall reliability of the release pipeline. You should also be able to architect a secure end-to-end release process: how code moves from commit to published artifact, how access is controlled, how secrets are handled, and how we reduce the chance of bad or unauthorized releases.
Own secure, regular releases for LiteLLM, including 2 nightly releases and 1 stable release, per week.
Manage and improve the infrastructure behind our release process, including Helm, Terraform, CI/CD, and other developer systems needed to keep releases stable.
Investigate test failures and determine whether they are true regressions, flaky tests, or dead tests that should be fixed or removed.
Write Python to fix minor test issues, improve release reliability, and support developer workflows.
Architect and implement a secure release process across build, test, approval, and publish steps.
Work closely with the engineering team to improve release quality, reduce operational risk, and keep shipping velocity high.
2+ years of experience in infrastructure engineering, DevSecOps, release engineering, or related systems work.
Proficient in Python and comfortable making code changes in test and release systems.
Experience with Terraform, Helm, CI/CD systems, and cloud infrastructure.
Strong judgment around release reliability, testing, and debugging.
Ability to distinguish between real regressions and flaky infrastructure or test behavior.
Ability to design secure release processes, including access controls, secrets handling, and safe publishing workflows.
Ability to collaborate effectively with engineers across product, infra, and security.
LiteLLM (https://github.com/BerriAI/litellm) is a Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere] and is used by companies like Rocket Money, Adobe, Twilio, and Siemens.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Senior Angular/Full-Stack Engineer to drive front-end architecture and build provider-facing treatment planning and eligibility UIs at Wellfit, working across Product, Design, and backend teams.
Zoox is hiring a skilled C++ software engineer to design and maintain high-performance, safety-critical drivers for lidar, radar, and camera sensors that feed the autonomous driving stack.
The Real Deal seeks a Full Stack Developer to build scalable, data-driven web applications and intuitive user experiences for its high-traffic real estate products.
Senior Salesforce Developer role at a data analytics and Salesforce consultancy, driving architecture, AI-assisted development, and cross-functional solution delivery in a fully remote environment.
Help build AI-first government software at Kaizen as a Product Software Engineer, delivering high-impact, real-world features used by millions.
Help scale Chime's AI-powered Jade assistant by building platform tooling, backend services, and observability systems as a Senior Full-Stack Engineer.
Senior Software Engineer (Mobile) to lead and deliver high-quality React Native mobile experiences while contributing across Rev’s full-stack platform to accelerate growth and engagement.
Lead the design and delivery of mission-critical, event-driven middleware for a private markets fintech platform while mentoring engineers and shaping backend engineering practices.
Bosch Rexroth is hiring a Summer 2026 Software Engineering Intern to develop C# tools that generate and optimize C++ code for embedded systems in mobile machine applications.
Polygon Labs seeks an AI Developer Experience Engineer to build org-wide AI tooling, agent integrations, and observability that speed AI adoption across a distributed blockchain-focused company.
Greenhouse is hiring a Senior UX Engineer, Design Systems to build reusable, accessible component patterns and documentation that enable product teams to ship faster and more consistently.
Superhuman seeks a Full-Stack Software Engineer to deliver scalable back-end services and rich front-end experiences as part of a hybrid engineering team empowering millions of users.
Senior Director responsible for leading application engineering and productionization to deliver enterprise-grade AI/ML and digital applications at scale for Pfizer's AI Acceleration organization.