Hardware Design Engineer, AI Inference Engine
Department: Engineering
Location: Seattle
Employment Type: FullTime
About Elastix AI
We are building the next-gen AI inference platform.
Description
Location: Seattle, WA (Hybrid - 3 days/week in office)
About ElastixAI:
ElastixAI is an early-stage startup poised to revolutionize AI inference infrastructure. We are developing a cutting-edge AI inference solution that dramatically improves efficiency through a holistic co-design approach, spanning from machine learning optimizations and a highly specialized software stack to the inference engine and underlying cloud hardware. We believe in providing a customizable and optimal inference experience, much like tailoring a high-performance computing system to specific needs.
Role Summary:
We are seeking a visionary and hands-on Hardware Design Engineer to contribute to the design, definition, and implementation of our core AI inference engine. This is a deeply technical role where you will be instrumental in translating AI into a highly efficient hardware design. You will be at the center of our co-design philosophy, working to ensure our inference engine is perfectly harmonized with our ML strategies, software stack, and cloud hardware targets to deliver unparalleled performance and efficiency for next-generation AI models.
Key Responsibilities:
Contribute to the architectural definition, design, and implementation of a novel AI inference engine optimized for our specific ML workloads.
Collaborate closely with ML engineers to understand and influence ML directions
Work hand-in-hand with software engineers to define a seamless hardware-software interface, ensuring the inference engine is highly programmable, efficient, and easy to integrate into our broader software stack and compiler.
Partner with cloud engineers to ensure the inference engine architecture aligns with target cloud hardware capabilities, deployment strategies, and performance/cost objectives.
Model and analyze the performance, power, and area (PPA) trade-offs of different architectural choices.
Stay at the forefront of AI accelerator research, identifying emerging techniques and technologies relevant to our co-design approach.
Contribute to the RTL design, simulation, and verification efforts for the inference engine components.
Drive the hardware roadmap for the inference engine, anticipating future AI model trends and optimization opportunities.
Foster a culture of innovation and technical excellence within a highly interdisciplinary engineering team.
Required Qualifications:
BS, MS or PhD in Computer Engineering, Electrical Engineering, or a related field.
Proven experience (5+ years) in hardware design, with a strong focus on designing/implementing hardware for AI/ML acceleration.
Deep understanding of modern AI/ML models, particularly LLMs, and their computational characteristics.
Experience with hardware implementation of ML optimization techniques (e.g., sparsity, quantization, pruning).
Proficiency in Verilog or SystemVerilog for RTL design and simulation.
Strong understanding of memory system architecture, on-chip interconnects, parallel processing, and distributed computing.
Excellent problem-solving skills and the ability to analyze complex systems.
Exceptional communication and interpersonal skills, with a demonstrated ability to work effectively in a highly interdisciplinary environment, collaborating with ML, software, and cloud/systems engineers.
Ability to thrive in a fast-paced, dynamic startup environment with a strong bias for action/execution
Preferred/Bonus Qualifications:
Knowledge of compiler technologies for AI models (e.g., MLIR, TVM).
Familiarity with performance modeling and analysis tools.
Experience with system-level integration and debugging.
Contributions to relevant research publications or open-source projects.
Understanding of cloud computing environments and deploying hardware accelerators in the cloud.
High-speed inter-chip networking experience
What We Offer:
A chance to be a foundational engineer in an innovative AI startup.
A dynamic and collaborative work environment and the change to have a significant impact on new technology
The opportunity to work on challenging problems at the intersection of ML, software, and systems.
Competitive compensation and startup equity package
Comprehensive medical, dental, and vision coverage (100% paid by employer)
Flexible Time Off (FTO)
Paid parental leave
Company sponsored 401K Plan
Gym or fitness benefit
Commuter benefit
Investment in employee learning & development
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
