Browse 2 exciting jobs hiring in Low Latency Inference now. Check out companies hiring such as Sandbar, ASAPP in Newark, Madison, Portland.
Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.
Lead the Core GenerativeAgent team to design, build, and deploy low-latency, enterprise-grade conversational voice AI combining LLMs with speech-to-text, text-to-speech, and real-time streaming pipelines.