FriendliAI

Software Engineer, AI Agents

San Francisco, CA
Python API Machine Learning HuggingFace LangChain LlamaIndex Kubernetes
Description

Software Engineer – AI Agents

Department: Engineering

Location: San Francisco

Employment Type: FullTime

About the Job

We’re seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.

These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.

Key Responsibilities

  • Design, build, and maintain agent APIs and applications that deliver document understanding and other high-value features

  • Evaluate and integrate open-source models to power production-ready agent features where possible

  • Develop reference agent applications to showcase workflows and accelerate customer adoption

  • Collaborate with backend and infrastructure teams to integrate agents with deployment, orchestration, and monitoring systems

  • Ensure APIs are robust, developer-friendly, and enterprise-ready through strong design principles and documentation

  • Continuously improve the reliability, scalability, and performance of agent features in production

Qualifications

  • 3+ years of experience in software engineering, preferably in backend, ML systems, or API development

  • Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent

  • Strong programming skills in Python; experience with various Python frameworks

  • Solid understanding of LLM workflows, agent patterns, or tool invocation systems

  • Experience designing and delivering production APIs

  • Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)

  • Strong foundations in cloud-native development

Preferred Experience

  • Experience with document understanding pipelines (e.g., OCR, RAG, summarization, structured extraction)

  • Familiarity with Kubernetes or container orchestration in production

  • Built or contributed to agent frameworks, SDKs, or CLIs

  • Have worked in a startup or fast-paced environments with ownership and ambiguity

  • Passion for developer experience and enabling AI adoption

Benefits

  • Flexible working hours

  • Daily lunch and dinner provided; unlimited snacks and beverages

  • Supportive and highly collaborative work environment

  • Health check-up support and top-tier equipment/hardware support

  • A front-row seat to the generative AI infrastructure revolution

  • Competitive compensation, startup equity, health insurance, and other benefits.

About FriendliAI

FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.

We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.

FriendliAI
FriendliAI

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say