Job details

Staff Software Engineer - Supernal

Staff Software Engineer

About Supernal

Supernal helps small-to-medium businesses hire their first AI employee. Our AI teammates are built using intelligent, agentic workflows deployed on a proprietary platform. We deliver working, value-generating AI Employees—not tools—that handle real business processes alongside human teams.

The Role

We're looking for a Staff/Principal Software Engineer to own and evolve the core platform that powers our AI employees. This is a technical leadership position responsible for the systems that enable our agents to scale reliably: the Django backend, distributed task infrastructure, event-driven architecture, Kubernetes deployments, and observability stack.

You'll work across the full system—from database query optimization to Helm chart tuning to designing new platform abstractions. You'll be a force multiplier for the engineering team, driving architectural decisions, eliminating scaling bottlenecks, and establishing patterns that make the platform more robust and developer-friendly.

This role reports to the Director of Engineering and involves significant autonomy in shaping technical direction.

What You'll Own

Drive platform architecture decisions and align the team on scalable patterns and long-term maintainability
Review a high volume of code, design docs, and architectural proposals for scalability, reliability, security, and operability
Be a technical mentor and force multiplier: unblock engineers, raise the bar on production readiness, and establish platform best practices
Own and evolve the core backend platform (Django/DRF/ASGI) performance and correctness
Scale async execution across Celery + Dramatiq + Temporal/Cortex; implement resilient workflow patterns (retries, circuit breakers, graceful degradation)
Optimize PostgreSQL/pgvector (query tuning, connection pooling) and caching strategies
Maintain and improve Kubernetes deployment infrastructure (GKE, Helm, Terraform/OpenTofu) and CI/CD + rollout strategies. Own KEDA autoscaling policies and resource allocation across worker pools.
Own reliability of RabbitMQ, Redis, and PostgreSQL infrastructure; lead incident response and post-mortems
Extend OpenTelemetry + Datadog instrumentation, dashboards, alerts, and SLOs; profile and reduce latency/memory bottlenecks

What We're Looking For

Required

10+ years building and operating production backend systems at scale
Deep expertise in Python (Django preferred) and relational databases (PostgreSQL)
Hands-on experience with Kubernetes, Helm, and cloud infrastructure (GCP preferred)
Strong background in distributed systems: message queues, event sourcing, workflow orchestration
Production experience with async task systems (Celery, Dramatiq, or similar)
Track record of debugging complex production issues across multiple services
Ability to work autonomously and drive technical initiatives without close supervision
Clear technical communication—able to explain tradeoffs and build consensus

Preferred

Experience with Temporal or similar workflow engines
Background in LLM infrastructure, RAG systems, or AI/ML platforms
Familiarity with OpenTelemetry, Datadog, or similar observability stacks
Experience with KEDA or other Kubernetes autoscaling solutions
Contributions to multi-tenant SaaS platform architecture
History of improving developer experience and platform abstractions

What Success Looks Like

Platform services maintain high availability with predictable performance under load
Scaling bottlenecks are identified and resolved proactively
New features ship faster because platform primitives are well-designed and documented
Incidents are rare, quickly detected, and thoroughly addressed
Engineers across the team adopt platform patterns and best practices
Technical debt is systematically identified and paid down
You're a trusted technical voice in architectural discussions

Compensation & Logistics

Compensation: Competitive salary commensurate with experience (Staff/Principal level)
Location: Remote
Type: Full-time
Requirements: Overlap with Americas timezones for collaboration; reliable high-speed internet

Staff Software Engineer Principal Engineer Python Django PostgreSQL pgvector Kubernetes Helm GCP Celery Dramatiq Temporal RabbitMQ Redis OpenTelemetry Datadog KEDA Distributed Systems Async Platform

Average salary estimate

$240000 / YEARLY (est.)

min

max

$180000K

$300000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Software Architect, AI Systems and Networking

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 6 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Senior Architect role to design and implement high-performance AI communication and memory libraries while driving hardware-software co-optimization across GPUs, DPUs, NICs, and interconnects at NVIDIA.

eCommerce Technical Lead, Application Development and Maintenance

Cardinal Health Hybrid US-Nationwide-FIELD

VIEW

Posted 15 hours ago

Lead the architecture, development, and stabilization of Cardinal Health's cloud-native eCommerce platforms while guiding distributed engineering teams and driving modernization efforts.

Direct-to-Consumer Engineering, DevOps Internships – Academic Year

NBCUniversal Hybrid 30 Rockefeller Plaza, New York, NY 10111, USA

VIEW

Posted 2 hours ago

NBCUniversal's DTC Engineering DevOps Academic Year internship offers a paid, part-time remote opportunity to support TVE infrastructure, CI/CD, and automation across Peacock and other DTC platforms.

Site Reliability Engineer

Clarity Innovations Hybrid Remote

VIEW

Posted 11 hours ago

Experienced Site Reliability Engineer needed to lead observability, automation, and data-focused reliability efforts for cloud-based national security systems in a collaborative, mission-driven environment.

AI Native, Software Engineer

ConsumerAffairs Hybrid No location specified

VIEW

Posted 9 hours ago

ConsumerAffairs is hiring an AI-native Software Engineer to design, build, and maintain scalable backend systems and full-stack features across a Django/Python and React codebase while using AI tools as an integral part of the workflow.

Software Engineer 3

InterImage Hybrid No location specified

VIEW

Posted 9 hours ago

Senior-level software engineer to design, implement, and lead development of complex, performance-sensitive systems and algorithms for InterImage.

Mgr, Software Engineering

Renesas Electronics Hybrid Austin, TEXAS

VIEW

Posted 13 hours ago

Lead and mentor a hybrid software engineering team at Renesas to deliver embedded software solutions using Java/Kotlin and Python while driving execution, collaboration, and process improvements.

Senior Angular/Full-Stack Software Engineer, Wellfit Plans

Wellfit Technologies Hybrid Irving, TX

VIEW

Posted 20 hours ago

Senior Angular/Full-Stack Engineer to drive front-end architecture and build provider-facing treatment planning and eligibility UIs at Wellfit, working across Product, Design, and backend teams.

Director of Engineering – vSphere Kubernetes Service

Broadcom Hybrid USA-CA - Promontory B

VIEW

Posted 12 hours ago

Lead Broadcom’s VKS engineering organization and upstream CNCF strategy to deliver a world-class Kubernetes experience on vSphere for enterprise customers.

Senior Director, Head of Application Engineering and Productionization

Pfizer Hybrid United States - New York - New York City

VIEW

Posted 23 hours ago

Senior Director responsible for leading application engineering and productionization to deliver enterprise-grade AI/ML and digital applications at scale for Pfizer's AI Acceleration organization.

Deployed Engineer (Salt Lake City)

LangChain Hybrid No location specified

VIEW

Posted 4 hours ago

Work with customers to co-architect, build, and operate production AI agents using LangChain’s platform and tools.

Film Technology AR/VR Internships – Academic Year

NBCUniversal Hybrid 100 Universal City Plaza, Universal City, CA 91608, USA

VIEW

Posted 17 hours ago

Academic Year internship at NBCUniversal's Universal Pictures Content Group focused on full-stack and AR/VR development, machine learning experimentation, and digital transformation projects.

Product Engineering Intern

Vendelux Hybrid No location specified

VIEW

Posted 9 hours ago

Work with Vendelux's Product Engineering team to build user-facing full-stack features and gain hands-on startup engineering experience in a backend-focused, remote-friendly internship.

I Infinity

3 jobs

MATCH

Calculating your matching score...

FUNDING

Growth

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Director / Expert

TEAM SIZE

No info