Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Forward Deployed Site Reliability Engineer image - Rise Careers
Job details

Forward Deployed Site Reliability Engineer

About the Company

At Twenty, we're taking on one of the most critical challenges of our time: defending democracies in the digital age. We develop revolutionary technologies that operate at the intersection of the cyber and electromagnetic domains, where the speed of operations exceeds human sensing and complexity transcends conventional boundaries. Our team doesn't just solve problems – we deliver game-changing outcomes that directly impact national security. We're pragmatic optimists who understand that while our mission of protecting America and its allies is challenging, success is possible.

Role Summary

You'll be our eyes, ears, and hands on the ground at a government customer site, ensuring the reliability and performance of Twenty's mission-critical platform running in a restricted, air-gapped AWS environment. This role sits at the intersection of deep technical ownership and customer-facing engineering: you'll define how we measure reliability, lead incident response in a constrained environment, and serve as the primary technical link between what's happening on-site and the engineering team back in Arlington. You'll work closely with the DevSecOps engineer to ensure the platform operates within government security and compliance requirements, and with product engineers to translate operational reality into actionable feedback. You'll report directly to the VP of Engineering. If you thrive operating with autonomy in high-stakes environments and find satisfaction in making complex systems provably reliable, this role is for you.

Who You Are

  • You own reliability outcomes, not just uptime dashboards — you define what "healthy" means and hold the system to it.

  • You're as comfortable writing a runbook as you are deep in a production incident with limited tooling and no safety net.

  • You operate well with minimal remote support — ambiguity doesn't paralyze you, and you know when to escalate versus when to solve it yourself.

  • You build trust naturally with external stakeholders, including government customers, and can translate complex technical situations into plain language under pressure.

  • You treat toil as a bug: if you're doing something manually more than twice, you automate it.

  • You communicate with precision — your incident reports and runbooks are read by people who weren't in the room, and they need to be right.

  • You understand that in a restricted environment, you are the feedback loop — and you take that responsibility seriously.

What You'll Do

Reliability Engineering

  • Define, track, and report on SLIs and SLOs for platform services running in the customer environment.

  • Use error budgets to drive reliability conversations with the Arlington engineering team, translating operational data into prioritized engineering work.

  • Identify and eliminate toil: build automation for repetitive operational tasks within the constraints of the secure environment.

  • Conduct post-incident reviews, own root cause analysis, and drive durable fixes in partnership with the engineering team.

Observability & Incident Response

  • Own the observability posture for the on-site deployment — dashboards, alerting thresholds, and log pipelines using the LGTM stack (Grafana, Loki, Tempo, Mimir).

  • Lead incident response on-site: triage, containment, coordination with Arlington, and customer communication.

  • Maintain and continuously improve runbooks for operational procedures and emergency response protocols.

  • Serve as the on-call anchor for the customer environment, with clear escalation paths to the engineering team.

Deployment & Infrastructure Operations

  • Work with the customer deployment team to get Twenty's platform stood up and updated within the restricted environment.

  • Manage containerized services (Docker, Docker Compose) across deployment lifecycle — configuration, updates, rollbacks.

  • Apply and validate Terraform-based infrastructure changes within the enclave, in coordination with the DSO engineer who owns IaC policy and guardrails.

  • Perform capacity planning and flag scaling requirements to the Arlington team before they become incidents.

Customer Liaison & Engineering Feedback

  • Serve as the primary technical interface between the government customer and Twenty's engineering team — translating operational requirements, constraints, and issues in both directions.

  • Represent the operational environment accurately in engineering discussions: what the team in Arlington can't see, you make visible.

  • Partner with the DevSecOps engineer on compliance, logging, and audit requirements specific to the customer environment.

  • Provide technical guidance and support to customer stakeholders on system behavior and troubleshooting procedures.

Must Have

  • 5+ years of professional experience in site reliability engineering, production operations, or a closely related infrastructure role.

  • Proven experience defining and tracking SLIs, SLOs, and error budgets in a production environment.

  • Hands-on experience with Docker, Docker Compose, and AWS (EC2, ECS, RDS, VPCs, security groups) in production deployments.

  • Solid Linux/Unix systems administration skills; productive in constrained environments where GUI tooling may be limited or unavailable.

  • Experience with Terraform for infrastructure provisioning and configuration, working within DSO-provided policy guardrails.

  • Experience with the LGTM observability stack or equivalent (Grafana, Loki, Prometheus/Mimir, distributed tracing).

  • Strong incident response experience: you've led responses, written post-mortems and runbooks, and shipped the preventive fix.

  • Scripting proficiency in Python or Bash for operational automation, with familiarity in Go a plus; experience with PagerDuty or equivalent on-call tooling.

  • Experience working in or directly supporting government or defense environments, including air-gapped or enclave deployments.

Nice To Have

  • Experience with NATS or similar pub/sub messaging systems in production.

  • Background in cyber operations, intelligence systems, or signals environments.

  • AWS certifications (Solutions Architect, SysOps, or DevOps Engineer).

Security Requirements

  • Must possess and be able to maintain a TS/SCI security clearance with appropriate polygraph

  • U.S. citizenship required

  • Willingness to travel occasionally for customer engagements and operational support

If this role sounds like you, apply and share with us your interest.

Some positions may require eligibility to obtain a U.S. Government security clearance. Any clearance requirement will be listed in the role description.

Twenty is an equal opportunity employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, veteran status, disability, or any other protected status.

If you need a reasonable accommodation during the hiring process, let us know and we will work with you.

Awesome Motive Glassdoor Company Review
4.2 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Awesome Motive DE&I Review
4.4 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
CEO of Awesome Motive
Awesome Motive CEO photo
Kartik Mandaville
Approve of CEO

Average salary estimate

$180000 / YEARLY (est.)
min
max
$140000K
$220000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Awesome Motive logo

What it's like to work at Awesome Motive

Read Reviews
Similar Jobs
Photo of the Rise User

Experienced commercial lines marketing/customer service professional needed at a family-owned Syracuse independent agency to market, quote, and service small-to-mid commercial accounts.

Photo of the Rise User
Posted 16 hours ago

Analytically driven intern to support Stride’s paid media reporting, forecasting, and channel optimization across digital marketing efforts.

Photo of the Rise User

Lead and grow a large engineering organization to deliver scalable, AI-driven media and intelligence platforms that serve hundreds of millions of users.

Photo of the Rise User

Lead application and cloud security for a fast-growing AI EdTech platform, embedding with engineering teams to build secure-by-default systems and developer-friendly security workflows.

Photo of the Rise User
Posted 2 hours ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Family Medical Leave
Paid Holidays

Life360’s Foundry team is hiring a Senior AI-Native Backend Engineer to build partner integration infrastructure and codify AI-first engineering practices across the platform.

Photo of the Rise User
Anduril Industries Hybrid Costa Mesa, California, United States
Posted 11 hours ago

Anduril is hiring a Senior Computer Vision Engineer to design and deploy real-time 3D perception and SLAM algorithms for autonomous aerial systems in Costa Mesa, CA.

Posted 8 hours ago

Findhelp is hiring an Engineer II to design and deliver scalable full-stack features, own production readiness, and collaborate across teams to expand the impact of its social-care platform.

Photo of the Rise User
Posted 12 hours ago

Lead the design and delivery of scalable, secure cloud services at Illumio as a Staff Backend Java Engineer focused on distributed systems, Kubernetes, and cloud architecture.

Photo of the Rise User
Posted 5 hours ago

Lead Abridge’s platform engineering efforts to scale cloud infrastructure, developer platforms, and CI/CD systems that power an AI-driven clinical documentation product.

Photo of the Rise User
Posted 3 hours ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Family Medical Leave
Paid Holidays

Lead the architecture and delivery of Life360's device cloud infrastructure, shaping AI-native engineering practices and systems that connect hardware to product experiences at scale.

Photo of the Rise User

Scribd is hiring a Staff Software Engineer to define and lead the architecture, tooling, and guardrails for agentic AI-assisted developer workflows across the engineering organization.

Photo of the Rise User
Posted 24 hours ago

Develop and maintain Angular front-end features for enterprise public-sector systems that enhance investigative workflows and data-driven decision-making.

Photo of the Rise User

Lead and scale the Web Platform engineering organization to deliver high-performance, SEO-driven web experiences using modern web technologies and strong cross-functional collaboration.

Photo of the Rise User
Posted 22 hours ago

Experienced Full Stack Developer needed to maintain and enhance WEBCANDID and TESTFLIGHT reporting tools, including on-call support for mission-critical operations.

Photo of the Rise User

Lead the web core and Chrome extension engineering efforts at Speechify, shipping high-impact features and shaping product direction for millions of users in a remote-first startup.

SpringRole is the first professional reputation network powered by artificial intelligence and blockchain to eliminate fraud from user profiles. Because SpringRole is built on blockchain and uses smart contracts, it's able to verify work experienc...

732 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
April 22, 2026
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!