Browse 21 exciting jobs hiring in Llm Optimization now. Check out companies hiring such as Awesome Motive, Zencore, Jobgether in Oceanside, El Paso, Chicago.
Hercules is hiring a senior/staff SEO Content Writer to lead keyword strategy, produce high-impact long-form content, and optimize it to drive traffic and paid conversions using AI-driven workflows.
Lead design and delivery of secure, scalable, production-grade AI/ML solutions as Zencore’s Principal Architect, advising clients and shaping cloud-native architectures.
Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.
Darkroom seeks an SEO Specialist to own SEO and GEO strategy across high-growth consumer brands, optimizing for both traditional search and LLM-driven discovery.
Character.AI is seeking a Product Marketing Manager to lead GTM strategy, own ASO across stores, and craft conversion-focused in-product copy for a high-growth consumer AI platform.
Experienced technical product leader needed to own prioritization, quality, and stakeholder alignment for LLM-driven products while staying hands-on with architecture, code reviews, and AI cost optimization.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Fortune Brands is hiring an SEO Specialist to lead SEO, GEO and AI-driven search optimizations for the Master Lock brand across eCommerce and owned websites.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
Lead the design and optimization of LLM and RAG systems that power personalized, data-driven insights for athletes and coaches at Texas Sports Academy.
Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Instrument is hiring a Senior AI Engineer to design and implement the core multi-agent intelligence, context management, and evals infrastructure for a large-scale, stateful generative-AI simulation project.
Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.
Lead the design and implementation of Slate's unified AI backend platform to make model integrations reliable, cost‑efficient, and production-ready at scale.
Concentrate is hiring a hands-on Forward Deployed AI Engineer to combine customer-facing problem solving with engineering work to improve multi-provider LLM routing, reliability, observability, and cost efficiency.
Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.
Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.