Concentrate provides one OpenAI-compatible API to access, route, and manage models across leading AI providers and open-source models through a single endpoint. We help teams save time, lower token spend with credits back from our bulk purchasing power, improve reliability, and avoid vendor lock-in.
Supported by top-tier VCs. This is a remote role.
You'll work directly with customers to solve LLM infrastructure and deployment problems, while also building the product and platform capabilities that make those solutions scalable. This is a highly hands-on role for someone who is technical, pragmatic, and excited to operate across customer work, engineering, and product at an early-stage AI API company.
What You'll Do
Work closely with customers to understand LLM deployment needs and solve technical problems in production
Debug issues end to end across application behavior, AI API integrations, infrastructure, and model and provider performance across OpenAI, Anthropic, Gemini, and open source models
Build product features, internal tools, and platform improvements based on patterns you see in the field
Improve multi-provider routing, LLM reliability, AI observability, latency, and token cost efficiency across multiple LLM providers
Help customers reduce AI infrastructure costs, navigate rate limits, and architect for provider failover and redundancy
Partner closely with founders on customer deployments, product direction, and technical strategy
What We're Looking For
Strong technical ability and high ownership
Strong debugging instincts across backend systems, AI APIs, infrastructure, and customer environments
Experience working with or around LLM APIs, model routing, or AI spend management is a strong plus
Comfort working directly with customers and operating in ambiguity
Startup experience or experience in fast-moving, high-ownership environments
Likely 5–12 years of experience, with flexibility for exceptional candidates
Experience with some of: Python, TypeScript/Node.js, PostgreSQL, Redis, AWS, Docker, Kubernetes, Terraform, and CI/CD workflows
Clear written and verbal communication skills
Fluent English required
Bonus
Experience with LLM gateways, AI gateway architecture, or enterprise AI infrastructure
Familiarity with zero data retention, PII redaction, or AI compliance requirements
Experience with LLM cost optimization, token spend analysis, or provider discount structures
Experience in forward deployed, solutions, or customer-facing technical roles
Founder or early startup experience
Interest in growing into broader technical leadership over time
Salary Range: $200K-$300K cash compensation + strong equity
Equal Opportunity & Fair Chance Notice
Concentrate AI is an affirmative action and equal opportunity employer. We are committed to providing equal employment opportunities and do not discriminate in recruiting, hiring, training, promotion, or other employment practices on the basis of race, color, sex, age, religion, national origin, ancestry, protected veteran status, disability, sexual orientation, gender identity or expression, genetic information, or any other status protected by applicable law.
Qualified applicants with arrest and conviction records will be considered in accordance with the San Francisco Fair Chance Ordinance and applicable state and local laws.
California Privacy Notice
California residents may contact privacy@concentrate.ai for additional information regarding how we collect, use, and disclose personal information during the job application process.
Recruitment Agency Notice
Concentrate AI does not accept unsolicited resumes from recruitment agencies and is not responsible for any fees related to unsolicited submissions.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Join Patreon's Identity & Access team to design and implement authentication, verification, and account-protection features that keep creators and their supporters safe and secure.
Help scale Chime's AI-powered Jade assistant by building platform tooling, backend services, and observability systems as a Senior Full-Stack Engineer.
Lead the design and delivery of mission-critical, event-driven middleware for a private markets fintech platform while mentoring engineers and shaping backend engineering practices.
Lead the architecture, development, and stabilization of Cardinal Health's cloud-native eCommerce platforms while guiding distributed engineering teams and driving modernization efforts.
Lead the development of scalable backend systems and CV-driven features for a fast-moving youth-sports platform, shaping automated highlights and video analytics used by millions.
Wellmark is hiring a Software Engineer to design and build data-focused integrations and pipelines that support HEDIS and quality measurement in a regulated healthcare environment.
Experienced Java Technical Lead/Architect needed to provide hands-on architecture, design reviews, and leadership for large-scale enterprise systems in Santa Clara.
NBC News is hiring Academic Year interns in New York across product, design, data/graphics, mobile development, and software engineering to contribute to real projects while earning $30/hour.
Constellation Technologies is hiring a TS/SCI-cleared AI Software Engineer to lead LLM orchestration, data engineering, and secure deployment efforts for mission-critical systems.
Evaluate and optimize real-world AI workloads on emerging hardware platforms to bridge the gap between expected and observed system performance for OpenAI’s infrastructure.
Work directly with the founder to harden rapid AI-driven prototypes into battle-tested, frontend-forward foundations for an early-stage precision medicine platform.
Lead Android core product development at Speechify to deliver high-quality, user-focused features for millions of learners using Kotlin and modern Android architecture.
Academic Year internship at NBCUniversal's Universal Pictures Content Group focused on full-stack and AR/VR development, machine learning experimentation, and digital transformation projects.