Browse 176 exciting jobs hiring in Reliability now. Check out companies hiring such as Fieldguide, Zapier, Clarity Innovations in New Orleans, Chandler, Washington.
Lead the design and implementation of evaluation infrastructure and observability for enterprise-grade AI agents powering audit and assurance workflows at Fieldguide's San Francisco office.
Lead and scale Zapier’s infrastructure and platform engineering organization, driving an AI-first platform strategy that boosts developer velocity, reliability, and product delivery.
Experienced Site Reliability Engineer needed to lead observability, automation, and data-focused reliability efforts for cloud-based national security systems in a collaborative, mission-driven environment.
Keller Executive Search is recruiting a hands-on Mechanical/Electrical Engineer to support operations, maintenance, and capital projects across Mississippi energy facilities.
Lead maintenance operations and partner with engineering at Wabtec's L&M Radiator Yankton facility to improve equipment reliability, safety, and production efficiency.
An experienced SRE/DevOps professional is needed to architect automation, observability, and runbooks for OCC's critical clearing platform while mentoring teammates and improving reliability.
Experienced SRE with a strong infrastructure background wanted to help operate, automate, and scale MongoDB Atlas across multi-cloud environments.
Experienced backend technical leader needed to architect and drive resilient, low-latency cloud-native systems for State Street’s Wealth Custody & Clearing platform.
EAG Laboratories (Eurofins) presents a virtual "Cultivating Talent & Community" info session to showcase materials-science career paths, mentorship programs, and opportunities across R&D and reliability services.
NVIDIA is hiring a Senior Staff Software Engineer to design agentic AI automation and build integrations to transform enterprise IT operations and prevent problems at scale.
Lead the design and scaling of enterprise-grade, reliable cloud platforms as an SRE Architect working with cross-functional teams in a hybrid Austin, TX environment.
Lead the design and operation of Axle Health's secure, scalable AWS infrastructure and CI/CD pipelines to support enterprise-grade, HIPAA-compliant in-home healthcare software.
Freshworks is hiring a Senior Director of Engineering in San Mateo to lead and scale the ITOM engineering organization focused on AIOps, observability, and high-scale cloud-native platforms.
Senior technical leader to architect next-generation DRAM systems—driving cross-stack co-optimization, RAS/telemetry features, and customer-facing strategy at Micron's Boise main site.
The University of Chicago's CTDS is hiring a Senior Platform Engineer to lead production support, CI/CD pipelines, monitoring, and security automation across hybrid cloud and on‑prem translational data science platforms.
Lead engineering delivery and quality across multiple remote teams to build secure, scalable healthcare systems while shaping engineering practices and mentoring managers.
Trimble is seeking a Site Reliability Engineer to strengthen and scale Vista Cloud infrastructure for enterprise AECO customers by delivering automation, robust monitoring, and deep technical support.
Be the engineer who designs and operates large-scale Linux infrastructure, CI/CD pipelines, and automation to power Intel's architecture modeling and simulation workflows.
Ro is hiring a Senior Site Reliability Engineer to strengthen and scale our AWS-based infrastructure, improve uptime and MTTR, and help embed reliability practices across the engineering organization.
Customer Experience Manager needed to coordinate customer programs, manage timelines and deliverables, and ensure high-quality engagement for a fast-moving deep-tech startup in millimeter-wave RF.
Hudu is hiring an experienced DevOps Engineer to operate and optimize its Rails-based SaaS infrastructure on AWS and Kubernetes, focusing on reliability, security, and performance.
SpaceX is hiring a Site Reliability Engineer to build and operate mission-critical application infrastructure that accelerates and secures vehicle and satellite software delivery.
Senior technical leader sought to shape LinkedIn’s core infrastructure strategy and lead cross-team initiatives across networking, storage, and messaging at massive scale.
Lead product strategy and execution for Constructor's prospect-facing demo and sandbox platform, balancing a sales-first mindset with a platform reliability approach to accelerate revenue.
Visa is hiring a Software Development Engineer on the Product Reliability Engineering team to build scalable automation, database platform tooling, and GenAI-powered reliability solutions for global payment infrastructure.
Intel is hiring a Sr. Facilities Engineer to lead mechanical system ownership, reliability engineering, and cross-discipline coordination for critical data center and lab environments at its Mission Campus.
Lead Range Energy's reliability efforts by designing reliability programs, predictive models, and test plans that ensure high uptime and durability for Class 8 electric vehicle systems.
Applied Materials is hiring an Operations and Customer Quality Engineer II to lead inspection, testing, and qualification activities that improve manufacturing quality at the Kalispell, MT site.
Lead the architecture and delivery of Crusoe's cloud and infrastructure management systems to enable highly available, secure, and scalable AI infrastructure.
Nabla seeks a senior SRE/Backend engineer to drive platform reliability and scalability for its clinical AI systems supporting clinicians across the US and EU.
Lead Rhoda AI's hardware organization to design, validate, and scale world-class humanoid robotic systems from concept through volume production.
HMA is hiring a Service Reliability Engineer II to provide operational ownership, reliability engineering, and performance optimization for a large distributed claims platform during a multi‑year modernization.
Lead the design, integration, and validation of robotics and automation systems at Jabil’s St. Petersburg/Tampa manufacturing site, driving reliability, safety, and cross-functional delivery excellence.
Experienced failure analysis engineer needed to lead root-cause investigations and reliability improvements for avionics and high-reliability electronics at Shield AI's Dallas facility.
Engineering internship at Vistra's Davis-Besse plant offering rotational, hands-on experience across multiple plant engineering disciplines within a safety-focused nuclear operations environment.
Lead PlayStation's Service Reliability Engineering team to own global uptime, stability, and operational excellence for FTG's cloud gaming infrastructure.
Hammerhead is hiring a Site Reliability Engineer to establish and run the reliability function for an AI-driven power orchestration platform deployed across cloud and on-prem data centers.
Lead a distributed SRE team at LexisNexis Risk Solutions to design and operate secure, automated, cloud-native infrastructure and drive on‑prem-to‑cloud migrations using Terraform, Azure, and modern CI/CD patterns.
Work on scalable automation and flight/ground software to operate Loft’s growing heterogeneous satellite fleet while serving as a rotating Flight Director.
Lead the Selection engineering team at Spotify to transform the Messaging Platform into a scalable, real-time, ML-informed decision engine that influences messaging for hundreds of millions of users.
Homebot seeks a Senior DevOps Engineer to lead multi-cloud (AWS and GCP) infrastructure design, operation, and developer enablement for our platform.
Heron Power seeks a Lead Dielectric Materials Engineer to define, qualify, and scale robust insulation systems for medium-voltage power conversion hardware.
Agilent Technologies seeks a Reliability Engineer to enhance equipment availability and drive preventive and predictive maintenance programs at its Frederick API manufacturing site while ensuring cGMP compliance.
Design and operate high-throughput backend systems at Mercor to power candidate-job matching, routing, and marketplace workflows.
Stitch Fix is hiring a Platform Engineer to enhance cloud-native infrastructure, developer tooling, and CI/CD workflows to improve developer experience across the company.
Lead PECVD equipment reliability and scaling for Starlink's high-efficiency solar cell production at SpaceX's Bastrop facility.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
CaptivateIQ is seeking a Staff Software Engineer to lead the technical direction and scaling of its Modeling Platform, turning its computation engine into a distributed, enterprise-grade service.
Anduril's Discovery team is hiring a Site Reliability Engineer to design and operate scalable, secure deployments that integrate cloud, robotics, and mesh networking for mission-critical systems.
Mach Industries is hiring a Lead EHS & Facilities Engineer to build and run a world-class safety, environmental, and facilities program that ensures site readiness, uptime, and regulatory compliance as the company scales production.
Below 50k*
1
|
50k-100k*
3
|
Over 100k*
68
|