Several years of non-internship professional software development experience
Several years of programming with at least one software programming language experience
Several years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Knowledge of Machine Learning algorithms and techniques
Experience as a thesis supervisor, mentor, tech lead or leading an engineering team
PhD in computer science or equivalent
Experience with Python, PyTorch, and C++ programming and performance optimization
Knowledge of LLMs and/or diffusion models
Experience with LLMs and/or diffusion inference, especially TensorRT-LLM and Trainium development
Excellent problem-solving skills, with the ability to think creatively and critically about complex problems
Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams
Publications in domain of Machine Learning, especially LLMs and diffusion models
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
The Artificial General Intelligence (AGI) Inference Engine team is looking for a highly-skilled Senior ML Engineer, to lead the research and implementation of novel techniques to push the boundaries of efficient inference for Generative Artificial Intelligence (GenAI) models.
Key job responsibilities
Design, develop, test and deploy inference solutions for high-end LLMs and diffusion models
Explore emerging inference optimization techniques for LLMs and diffusion models
Design and conduct experiments, analyze results and influence Amazon AGI roadmap by providing recommendations
Optimize performance of LLMs and diffusion models on best-in-class GPU and AWS Neuron hardware
Collaborate with cross-functional teams of engineers and scientists to identify and solve complex problems in GenAI
Mentor and guide junior engineers, and contribute to the overall growth and development of the team
A day in the life
Senior engineers in our team are independent and have much freedom in the way they contribute. They read recent papers, explore ideas, brainstorm and freely connect with others to cooperate. They detect, investigate and resolve complex problems, sometimes through creating software solutions, and sometimes just through finding and reusing something that exists. They partner with leaders to focus on right things and make good decisions. They are making sure the team follows best design and programming best practices, as well as lead operational efforts.
About the team
Our mission is to build best-in-class, fast, accurate, and cost-efficient large language and diffusion model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
Other Jobs from Amazon
Software Dev Manager (Level 6), Fintech Tax
Software Development Engineer, Amazon
Software Development Manager, Amazon Flex
Senior Manager, Software Development, Tax Services
Support Engineering Manager, RBS
Similar Jobs
Cloud Storage Engineer III
Software & Hardware Performance Architect
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say