Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Reliability Engineer image - Rise Careers
Job details

Reliability Engineer

This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a Reliability Engineer. The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of mission‑critical systems. This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement. The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities.

 

 

Location: This position will be hybrid remote. Candidates will be required to work onsite as needed. Candidates preferred to be located near Hanscom AFB (Boston, MA).

System Reliability & Availability

  • Design, implement, and maintain highly available, fault-tolerant systems in cloud and hybrid environments
  • Define, measure, and report Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets
  • Identify reliability risks and implement mitigation strategies across the system lifecycle
  • Conduct capacity planning and performance modeling to ensure systems scale to meet demand

Monitoring, Observability & Alerting

  • Implement and manage monitoring, logging, and tracing solutions to provide full system observability
  • Define actionable alerting thresholds that minimize noise and enable rapid incident detection
  • Analyze trends and metrics to proactively identify potential reliability issues

Incident Response & Problem Management

  • Participate in on‑call rotations and lead incident response activities for production systems
  • Coordinate troubleshooting efforts across development, infrastructure, and security teams
  • Conduct post‑incident reviews (PIRs) and develop corrective and preventive action plans
  • Track recurring issues and ensure root causes are resolved

Automation & Engineering Excellence

  • Automate operational tasks to reduce manual intervention and operational risk
  • Develop scripts, tools, and services that improve system reliability and reduce mean time to recovery (MTTR)
  • Promote “automation over toil” and standardize operational workflows

Reliability‑Focused Engineering

  • Participate in architecture and design reviews with an emphasis on reliability, resiliency, and recoverability
  • Validate disaster recovery (DR) and business continuity plans; test failover mechanisms
  • Support chaos engineering, fault injection testing, and resilience validation where appropriate

Collaboration & Governance

  • Partner with DevOps, Platform, and Security teams to ensure reliability aligns with delivery and compliance objectives
  • Document system reliability standards, runbooks, and operational procedures
  • Support compliance and audit activities (e.g., FedRAMP, FISMA, internal operational controls)

 

Required Skills:

·       Bachelors and eight (8) years or more of experience; Masters and six (6) years or more of experience. Additional experience may be accepted in lieu of degree.  

·       Active Secret clearance at a minimum required to start  

·       US citizenship required 

·       Experience with cloud platforms (AWS, Azure, OCI, or GCP), including managed services

·       Experience with containerized environments (Docker, Kubernetes)

·       Familiarity with CI/CD pipelines and deployment automation

·       SLOs and error budgets

·       Capacity modeling and performance testing

·       Strong understanding of:

·       Distributed systems and high‑availability architectures

·       Linux/Windows system administration

·       Networking fundamentals (DNS, TCP/IP, load balancing)

·       Hands-on experience with:

·       Monitoring and observability tools (e.g., Prometheus, Grafana, ELK/Elastic, Datadog, Azure Monitor)

·       Infrastructure as Code (Terraform, ARM, CloudFormation)

·       Scripting or programming languages (Python, Bash, Go, PowerShell, or similar)

·       Experience supporting incident management and on‑call operations

 

Preferred Skills

  • Experience with USAF Cloud One or Platform 1. 
  • Experience with Zero Trust Architecture 
  • Cloud certifications in AWS, Azure, Google, or Oracle clouds 

SES provides a competitive salary and the following benefits:

  • Medical
  • Dental
  • Vision
  • AD&D
  • STD
  • LTD
  • Company paid Life Insurance
  • 401k with employer contribution
  • Paid Time Off
  • Pet Insurance

Average salary estimate

$145000 / YEARLY (est.)
min
max
$120000K
$170000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Systems Engineering Solutions Corporation logo

What it's like to work at Systems Engineering Solutions Corporation

Read Reviews
Similar Jobs
Posted 19 hours ago

Help design and implement the UI and interaction layer between engineers and Archie, shaping workflows and real-time systems that make AI a practical engineering teammate.

Posted 19 hours ago

Help scale Chime's AI-powered Jade assistant by building platform tooling, backend services, and observability systems as a Senior Full-Stack Engineer.

Photo of the Rise User
Trimble Hybrid US - Remote, MN
Posted 5 hours ago

Software Engineer to develop and improve high-availability web services and apps for Trimble Maps, with an emphasis on strong coding, problem solving, and iterative delivery.

Photo of the Rise User

Work remotely as a Front-End Application Developer building accessible, scalable React/Angular applications for environmental data platforms while contributing across the full stack.

Photo of the Rise User
Fundrise Hybrid No location specified
Posted 11 hours ago

Work on high-impact screening and fraud-prevention systems at Fundrise, building reliable, scalable software that protects millions of users while partnering closely with Legal, Finance, and Operations.

Photo of the Rise User
Vendelux Hybrid No location specified
Posted 8 hours ago

Work with Vendelux's Product Engineering team to build user-facing full-stack features and gain hands-on startup engineering experience in a backend-focused, remote-friendly internship.

Photo of the Rise User

Lead architecture and engineering efforts to design, build, and deliver scalable, containerized applications using Golang, JavaScript, and Python for mission-driven federal clients.

Photo of the Rise User
Posted 10 hours ago

Develop and maintain Angular front-end features for enterprise public-sector systems that enhance investigative workflows and data-driven decision-making.

Be the Forward Deployed Engineer who owns customer integrations, shapes product direction, and ensures stablecoin payments integrate reliably with partners at an early-stage NYC startup.

A paid summer Software Engineering Internship at Gen (NortonLifeLock) offering hands-on experience building and maintaining production code within a leading consumer cybersecurity organization.

NextGen Federal Systems seeks a seasoned Senior Software Engineer to lead full-stack TypeScript/React/Node development and deliver secure, mission-critical software in an agile, DevSecOps-aware environment.

Senior Angular/Full-Stack Engineer to drive front-end architecture and build provider-facing treatment planning and eligibility UIs at Wellfit, working across Product, Design, and backend teams.

Photo of the Rise User

Lead the architecture, development, and stabilization of Cardinal Health's cloud-native eCommerce platforms while guiding distributed engineering teams and driving modernization efforts.

SES is an industry leader in verification services with projects ranging from conformance with self-imposed sustainability standards to the functioning of national voluntary programs. Since 1998, SES has supported governmental and private clients ...

23 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 1, 2026
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!