Description

As we prepare to deploy our models across various device types, including GPUs, CPUs, and NPUs, we're seeking an expert who can optimize inference stacks tailored to each platform. We're looking for someone who can take our models, dive deep into the task, and return with a highly optimized inference stack—leveraging existing frameworks like ggml, vllm, and DeepSpeed to deliver exceptional throughput and low latency.

The ideal candidate is a highly skilled engineer with extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures. They should be self-motivated, capable of working independently, and driven by a passion for optimizing performance across diverse hardware platforms. Proficiency in building and enhancing inference stacks using frameworks like ggml, vllm, and DeepSpeed is essential. Additionally, experience with mobile development and expertise in cache-aware algorithms will be highly valued.

Responsibilities

Strong ML Experience: Proficiency in Python and PyTorch to effectively interface with the ML team at a deeply technical level.

Hardware Awareness: Must understand modern hardware architecture, including cache hierarchies and memory access patterns, and their impact on performance.

Proficient in Coding: Expertise in Python, PyTorch, and either CUDA, Triton, or C++ is essential for this role.

Optimization of Low-Level Primitives: Responsible for optimizing core primitives to ensure efficient model execution.

Self-Guided and Ownership: Ability to independently take a PyTorch model and inference requirements (e.g., maximize GPU throughput or minimize CPU latency) and deliver a fully optimized stack with minimal guidance.

Research-Driven: Should stay up-to-date with advancements in ML inference, such as new quantization techniques or speculative decoding, while maintaining focus on delivering practical solutions.

Liquid AI

Artificial Intelligence (AI) Generative AI Information Technology Machine Learning

0 applies

53 views

Other Jobs from Liquid AI

Member of Technical Staff - Edge AI Inference Engineer

San Francisco, CA Boston, MA

Member of Technical Staff - Machine Learning Research Engineer, Post-Training

Boston, MA San Francisco, CA

Member of Technical Staff - Applied Machine Learning Lead

San Francisco, CA Boston, MA

Member of Technical Staff - Machine Learning Engineer, Training Infrastructure

San Francisco, CA Boston, MA

Member of Technical Staff - Applied Machine Learning Engineer

San Francisco, CA Boston, MA

Member of Technical Staff - Machine Learning Engineer, Data

Boston, MA San Francisco, CA

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say

Sid

Very nice portal for searching jobs in this rough market.

Mar 6, 2025

Michael Duran

Software Engineer

I've been using this job search site for a while now, and it’s honestly one of the best out there! The clean and easy-to-navigate UI makes the whole job-hunting process so much smoother. Plus, the job postings are always up-to-date, so I never feel like I’m wasting time. The cherry on top is the owner—super kind and always quick to respond. Definitely recommend checking it out if you're on the job hunt!

Aug 21, 2024

Sai

It’s really great website for finding jobs based on skills it’s really helpful give a go

Aug 21, 2024

Adinadh

What I like most about Echo Jobs is how easy it is to use. The platform helps me quickly find jobs that match my skills and interests, thanks to its great recommendations and filters. Yes, I would definitely recommend Echo Jobs to a friend. It makes job searching simple and efficient, making it a great tool for anyone looking for a new job.

Jul 23, 2024

Rahim

Software Engineer

As a student navigating the job market, I've found LinkedIn increasingly frustrating due to numerous fake postings by consultancies. In contrast, this job posting website has been a game-changer for me. It offers genuine opportunities and a straightforward application process, making it much easier to find and apply for real jobs. Highly recommend it to fellow students seeking reliable job listings!

Jul 16, 2024

Cliff Gor

Software Engineer

Echo Jobs has been exceptional in my job hunt where it provides one platform to job hunt and I don't have to open 10 websites just to look for a job. It has also helped me focus much on the job skill and the location filtering out the onsite jobs and remote ones. The only feature that I would request is to display fully remote jobs that are not restricted to a country since the one available shows ie, Remote, US yet. But if it could show remote only, that would be helpful not only to me but to other people applying for full remote and not tied to only US candidates

Apr 22, 2024

Mauro

Software Engineer

I found EchoJobs in 2022, and I love it. It has a lot of remote jobs. It's exclusive to software and technology jobs (helpful for devs like me). What I like the most are its filters and its API. If you're a tech professional seeking remote work, I highly recommend giving it a try to EchoJobs.

Mar 4, 2024

Kenn Kibadi

Founder & Product Engineer @ EarlyAccessHQ.com

Would definitely recommend it! Excellent product, dedicated founder, Jobs are easier to find. Congrats 🎉 to the entire team!

Mar 3, 2024

Brandon Banks

Echo Jobs is really impressive. It provides a great user experience with an ability to quickly search through the many job postings. There is an impressive amount of jobs here and it is quickly updated. The details in the each job posting is helpful when determining if it is worth pursuing. I would highly recommend using Echo Jobs to find the next step in your career.

Mar 2, 2024

Tyler Young

tylerayoung.com

Best wishes with EchoJobs—it's become my favorite job board overnight!

Dec 16, 2023

Gabriel

Remote Job Seeker

Simply put, it's the most up to date tech jobs aggregator I’ve found. I'm like... "I don't have to check 10+ jobs boards daily just to see if there's a new job listing? sign me up!" The filters are also quite helpful! The UI is very clean and straightforward. Love it!

Oct 5, 2023

Collect testimonials with Senja

Liquid AI

Member of Technical Staff - Machine Learning Engineer, Inference

Responsibilities

Other Jobs from Liquid AI

Member of Technical Staff - Edge AI Inference Engineer

Member of Technical Staff - Machine Learning Research Engineer, Post-Training

Member of Technical Staff - Applied Machine Learning Lead

Member of Technical Staff - Machine Learning Engineer, Training Infrastructure

Member of Technical Staff - Applied Machine Learning Engineer

Member of Technical Staff - Machine Learning Engineer, Data