Browse 96 exciting jobs hiring in Prometheus now. Check out companies hiring such as Clarity Innovations, theocc, Broadcom in Fort Worth, Detroit, Phoenix.
Experienced Site Reliability Engineer needed to lead observability, automation, and data-focused reliability efforts for cloud-based national security systems in a collaborative, mission-driven environment.
An experienced SRE/DevOps professional is needed to architect automation, observability, and runbooks for OCC's critical clearing platform while mentoring teammates and improving reliability.
Experienced software engineer needed to build scalable, high-throughput security analytics and micro-segmentation features using Kubernetes, Go/Java/Python, and big-data tooling at Broadcom.
Lead the design and scaling of enterprise-grade, reliable cloud platforms as an SRE Architect working with cross-functional teams in a hybrid Austin, TX environment.
Senior DevOps Engineer to lead CI/CD, automation, and large-scale test pipelines for Shield AI’s autonomy and aircraft software in Dallas.
Experienced Staff Software Engineer (Java/Spring) needed to lead integration and platform architecture, modernize APIs and cloud infrastructure, and enable secure AI/LLM integrations for a fast‑growing TPRM platform.
Mistral AI is hiring a Backend Engineer in New York to build scalable, high-performance backend services and APIs for its enterprise AI platform and consumer-facing products.
Work on core cloud platform infrastructure and analytics at UiPath, building backend systems, Kubernetes-based primitives, and production-grade observability to power AI-driven automation at scale.
Winsupply seeks an Intermediate Full-Stack Java Developer at its Moraine support campus to design, build, and maintain scalable RESTful services and integrations from design through production.
Lead the design, automation, and security of Novo's cloud infrastructure and developer platform as a senior individual contributor driving reliability and velocity.
Visa is hiring a Software Development Engineer on the Product Reliability Engineering team to build scalable automation, database platform tooling, and GenAI-powered reliability solutions for global payment infrastructure.
Constructor seeks a Senior Backend Engineer to design and operate low-latency, high-throughput Attribute Enrichment and Badges services that deliver ML-generated item attributes to global e-commerce customers.
Ivo is hiring a Senior Infrastructure Engineer to build and scale secure, multi-tenant cloud infrastructure and create LLM-driven tooling to boost reliability and developer velocity.
Help grow LiteLLM's developer community and drive adoption by creating technical content, engaging with customers, and representing the product at events and in pre- and post-sales technical conversations.
Nabla seeks a senior SRE/Backend engineer to drive platform reliability and scalability for its clinical AI systems supporting clinicians across the US and EU.
Shield AI is hiring a Staff DevOps Build Engineer to own and scale the C++ build and CI infrastructure that powers its autonomous aircraft platforms.
Provide L3 technical support and deep-dive troubleshooting for Redis Enterprise customers, handling escalations and performance issues across cloud and on-prem environments on a weekend-including schedule.
Experienced Linux platform engineer needed to develop Python-based integrations, containerized deployments, cloud automation, and AI-enhanced infrastructure monitoring for multi-site enterprise environments.
Hammerhead is hiring a Site Reliability Engineer to establish and run the reliability function for an AI-driven power orchestration platform deployed across cloud and on-prem data centers.
Lead the design and implementation of a scalable, governance-first AI Agent Platform and SDKs to productionize agentic workflows across GEICO.
Unstructured is hiring a Senior Technical Support Engineer to debug and resolve production issues across customer VPC deployments, Kubernetes clusters, and data pipelines while partnering closely with Engineering and customer teams.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
K1X is hiring a hands-on Machine Learning Operations Engineer to design and operate scalable ML infrastructure, pipelines, and production inference systems for a fully remote, Midwest-preferred startup.
Senior Full Stack Engineer to build end-to-end features for a remote-first, venture-backed fintech platform powering programmable stablecoins.
Senior Systems Engineer required to lead end-to-end infrastructure, cloud, and security engineering for cleared government programs and enterprise clients.
Samsung Austin Semiconductor is hiring an Engineering System Reliability (ESR) Engineer to maintain high-availability monitoring, lead incident response, and build CI/CD automation for critical MES and engineering systems in a 24x7 manufacturing environment.
Lead the Integrations & API engineering team to design and scale a first-class API and integrations platform that powers enterprise workflows across the NodeZero product.
Finch is hiring a Staff Software Engineer, Platform to lead critical infrastructure and developer-experience initiatives that enable reliable, high-velocity engineering across the company.
Lead Backend Software Engineer at Bumble Inc. to design and deliver scalable AWS-native backend systems, drive architecture decisions, and mentor engineering teammates.
Kochava is hiring a Senior Site Reliability Engineer to develop and operate scalable, highly available infrastructure and tooling across cloud and on-prem environments.
Anduril's Discovery team is hiring a DevOps Software Engineer to design and operate CI/CD, IaC, containerized deployments, and MLOps pipelines for high-impact autonomy and networking systems.
Lead Percona’s Observability Practice to design and deliver open-source-first monitoring, telemetry, and observability solutions across database environments while shaping product, partnerships, and go-to-market efforts.
Experienced SRE/DBA skilled in SQL Server, system administration, and cloud operations to ensure high-availability and performance of Intelerad's medical imaging platforms.
Build end-to-end software and infrastructure for large-scale GPU AI clusters as a New Grad Software Engineer at Zettabyte, working across frontend, backend, and Kubernetes operations.
Lead the architecture and evolution of Crusoe’s large-scale observability platform to provide reliable metrics, logs, and traces for multi-region AI infrastructure.
Mozilla is hiring a Senior Software Engineer to lead backend development for Firefox Monitor, building scalable Node.js/TypeScript cloud-native systems that protect user privacy and reliability.
Pangram Labs is hiring a Senior Backend Software Engineer to design, build, and scale the production systems that serve its AI-detection platform in Brooklyn.
Help build and operate the core observability platforms that ingest, store, and surface telemetry for Crusoe's large-scale cloud and data-center infrastructure.
Experienced SRE needed to lead multi-cloud reliability, observability, and automation at a fast-growing defense-focused infrastructure company.
Lead Observability Engineer to build and operate Vantage’s Elastic Stack telemetry platform, define metrics and alerting, and enable operational visibility across data center and hybrid environments.
Lead the design and operation of a BYOC, Kubernetes-based platform to deploy and scale Archie across customer environments at P-1 AI.
Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.
Medtronic is hiring a Principal Software Cloud Engineer to architect and implement cloud-native microservices for CRM Software at its Minneapolis site.
Workday Government is hiring an SRE-focused software engineer to operate, troubleshoot, and harden large-scale cloud services for U.S. federal customers, requiring U.S. citizenship and clearance eligibility.
Intel's STTD team seeks a hybrid Software & Infrastructure Engineer to develop and operate cloud and on-prem distributed systems that enable semiconductor test and manufacturing.
Calix is hiring a Staff Cloud Platform Engineer (Kafka) to architect and operate large-scale Kafka streaming platforms and automation on GCP/AWS to support mission-critical real-time data pipelines.
Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.
Live Nation is hiring a Software Developer to build and operate scalable, secure streaming and cloud-native systems while improving automation, monitoring, and DevOps delivery.
Visa is seeking a Senior Network Engineer to modernize and automate its network monitoring and fault-management platforms by integrating vendor tools, cloud services, IaC, and GenAI-driven analytics.
Lead and grow a small platform engineering team to own Runway's TypeScript-based API platform, data pipelines, and revenue-generating public API domain.
Below 50k*
0
|
50k-100k*
5
|
Over 100k*
91
|