We’re seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.
These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.
Design, build, and maintain agent APIs and applications that deliver document understanding and other high-value features
Evaluate and integrate open-source models to power production-ready agent features where possible
Develop reference agent applications to showcase workflows and accelerate customer adoption
Collaborate with backend and infrastructure teams to integrate agents with deployment, orchestration, and monitoring systems
Ensure APIs are robust, developer-friendly, and enterprise-ready through strong design principles and documentation
Continuously improve the reliability, scalability, and performance of agent features in production
3+ years of experience in software engineering, preferably in backend, ML systems, or API development
Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent
Strong programming skills in Python; experience with various Python frameworks
Solid understanding of LLM workflows, agent patterns, or tool invocation systems
Experience designing and delivering production APIs
Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)
Strong foundations in cloud-native development
Experience with document understanding pipelines (e.g., OCR, RAG, summarization, structured extraction)
Familiarity with Kubernetes or container orchestration in production
Built or contributed to agent frameworks, SDKs, or CLIs
Have worked in a startup or fast-paced environments with ownership and ambiguity
Passion for developer experience and enabling AI adoption
Flexible working hours
Daily lunch and dinner provided; unlimited snacks and beverages
Supportive and highly collaborative work environment
Health check-up support and top-tier equipment/hardware support
A front-row seat to the generative AI infrastructure revolution
Competitive compensation, startup equity, health insurance, and other benefits.
FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.
We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Superhuman seeks a Full-Stack Software Engineer to deliver scalable back-end services and rich front-end experiences as part of a hybrid engineering team empowering millions of users.
Lead application and cloud security for a fast-growing AI EdTech platform, embedding with engineering teams to build secure-by-default systems and developer-friendly security workflows.
Alegeus is hiring a Software Engineer II to design, develop, and maintain .NET-based SaaS applications that support fintech and healthtech solutions in a collaborative, hybrid environment.
NextGen Federal Systems seeks a seasoned Senior Software Engineer to lead full-stack TypeScript/React/Node development and deliver secure, mission-critical software in an agile, DevSecOps-aware environment.
Experienced Site Reliability Engineer needed to lead observability, automation, and data-focused reliability efforts for cloud-based national security systems in a collaborative, mission-driven environment.
Lead and mentor a software engineering team at Renesas to deliver high-quality embedded and application software while driving execution and cross-functional collaboration.
Senior Architect role to design and implement high-performance AI communication and memory libraries while driving hardware-software co-optimization across GPUs, DPUs, NICs, and interconnects at NVIDIA.
Senior Technical Architect needed to lead architecture, prototyping, and technical decisions for R&D product work on a Tiered Pricing Mechanism in a remote Web3/DeFi research unit.
Work with customers to co-architect, build, and operate production AI agents using LangChain’s platform and tools.
Experienced C++ engineers are needed to evaluate, repair, and improve AI-generated code as contractor contributors to an RLHF pipeline.
Bosch Rexroth is hiring a Summer 2026 Software Engineering Intern to develop C# tools that generate and optimize C++ code for embedded systems in mobile machine applications.
Contribute to Isaac Lab as a Software Engineering Intern focused on building scalable simulation, perception-in-the-loop RL, and sim-to-real capabilities for robot learning at NVIDIA.
An opportunity for a motivated student to join a development team as a Software Engineer Intern and work on Angular front-ends and C# backend services while leveraging AI development tools.