About Polymath
Polymath is an applied research lab focused on advancing long-horizon agent capabilities through reinforcement learning. We design and scale simulation environments where agents learn to operate safely and autonomously. We work with the world’s leading model labs to push the frontier of agent capabilities. Polymath is backed by Base10, Founders Future, Y Combinator, and other incredible investors & angels. We've raised an $8M seed, and are actively growing out the team.
About the role
We’re hiring Software Engineers to build the simulation environments, tasks, and verifiers that challenge frontier models. You’ll help create the training and evaluation grounds that make it possible to measure and improve autonomous agents on realistic, challenging work. This is a contract-based role with the opportunity to transition into a full-time position.
Examples of projects you could work on include:
Building diverse, high-fidelity environments that test agents in realistic settings
Designing complex tasks that require long-horizon reasoning and tool use
Developing robust verifiers that reliably measure agent performance
Improving infrastructure and tooling to run, debug, and improve environments
Working closely with the research team to identify failure modes and turn them into new tasks and benchmarks
Have strong engineering fundamentals
Enjoy building from first principles and solving open-ended technical problems
Have high agency and a strong bias toward shipping
Have a high quality bar and care about building robust systems
Culture:
Polymath is a team of researchers, engineers, and operators focused on advancing the frontier of safe, superintelligent AI agents.
We have a flat organizational structure. We believe that people do their best work when they’re self-motivated and driven by a desire to learn, contribute to the team’s goals, and advance scientific progress.
We’re looking for folks who ship fast, set high standards for themselves, and are great team players.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Help build TierZero's core product as a founding engineer, designing agentic LLM systems, ML pipelines, and scalable infrastructure to accelerate how teams run code in production.
Graphite is hiring a Software Engineer to help architect and build a real-time collaborative code review platform while shaping the company’s technical direction in NYC.
Superhuman seeks a Full-Stack Software Engineer to deliver scalable back-end services and rich front-end experiences as part of a hybrid engineering team empowering millions of users.
Ironclad is hiring an AI-native GTM Engineer to architect and deploy autonomous agent systems and integrations that automate end-to-end marketing workflows and drive measurable revenue impact.
Lead application and cloud security for a fast-growing AI EdTech platform, embedding with engineering teams to build secure-by-default systems and developer-friendly security workflows.
Experienced Java/J2EE developer needed to lead enhancements for a retail e‑commerce core platform, with Oracle and ATG experience strongly preferred.
Evaluate and optimize real-world AI workloads on emerging hardware platforms to bridge the gap between expected and observed system performance for OpenAI’s infrastructure.
Experienced C++ engineers are needed to evaluate, repair, and improve AI-generated code as contractor contributors to an RLHF pipeline.
Senior Director responsible for leading application engineering and productionization to deliver enterprise-grade AI/ML and digital applications at scale for Pfizer's AI Acceleration organization.
Work with Vendelux's Product Engineering team to build user-facing full-stack features and gain hands-on startup engineering experience in a backend-focused, remote-friendly internship.
Work with customers to co-architect, build, and operate production AI agents using LangChain’s platform and tools.
ConsumerAffairs is hiring an AI-native Software Engineer to design, build, and maintain scalable backend systems and full-stack features across a Django/Python and React codebase while using AI tools as an integral part of the workflow.
Lead Operational Software Deployment and Integration Engineer responsible for on-site mission software deployment, integration, configuration control, and field readiness for Boeing Phantom Works at Beale AFB.