Sr. Development Engineer -- ML Training and Performance THE ROLE: We are looking for a Machine Learning Engineer to join our Models and Applications team. If you are excited by the challenge of distributed training of large models on a large number of GPUs, and if you are passionate about improving training efficiency while innovating and generating new ideas, then this role is for you. You will be part of a world class team focused on addressing the challenge of training generative AI. THE PERSON: The ideal candidate should have experience with distributed training pipelines, be knowledgeable in distributed training algorithms (Data Parallel, Tensor Parallel, Pipeline Parallel, ZeRO), and be familiar with training large models. KEY RESPONSIBILITIES: Train large models to convergence on AMD GPUs. Improve the end-to-end training pipeline performance. Optimize the distributed training pipeline and algorithm to scale out. Contribute your changes to open source. Stay up-to-date with the latest training algorithms. Influence the direction of AMD AI platform. Collaborate across teams with various groups and stakeholders. PREFERRED EXPERIENCE: Experience with ML frameworks such as PyTorch, JAX, or TensorFlow. Experience with distributed training and distributed training frameworks, such as DeepSpeed. Experience with LLMs or computer vision, especially large models, is a plus. Excellent Python programming skills, including debugging, profiling, and performance analysis. Experience with ML pipelines. Strong communication and problem-solving skills. ACADEMIC CREDENTIALS: A Master's degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field. LOCATION: San Jose, CA #LI-MV1 #LI-HYBRID
At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
Tags: No, USD $120,750.00/Yr., USD $172,500.00/Yr., US Careers (External)
Other Jobs from AMD
Systems Engineering Program Manager
SMTS Software Development Engineer - C++, AI Software Solutions
Sr. Software Development Engineer
Sr. Server Signal & Power Integrity Engineer
Senior Graphics Verification Engineer
Similar Jobs
Principal Research Engineer, AI
People Data Analyst - Part Time
Software Engineer -SG6 - MTY
Senior Cyber Security Engineer
Principal Software Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say