- Contribute to our developer infrastructure, including simulation and HW emulation platforms, to enable performance measurement and optimization for Meta’s in-house accelerator programs.
- Understand and contribute to the collective communications library, intended to be deployed on Meta’s AI/ML superclusters.
- Support networking and compute hardware acceleration techniques to improve ML inference and training model performance.
- Perform architectural analysis to ensure system designs meet performance, scalability, and reliability requirements.
- Implement simulation models for Meta’s Accelerator ASICs, develop and analyze various scenarios to evaluate data center performance and identify potential improvements.
- Collaborate with architects and engineers to integrate simulation results into system design processes.
- Use instruction set simulators to define performant firmware for Meta's training/inference accelerators.
- Collaborate with hardware and firmware teams to ensure accurate modeling and simulation of accelerator functionalities.
- Analyze simulation results to guide firmware development and optimization efforts.
- Masters or doctorate degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- 5+ years of programming experience in C++.
- 5+ years of programming experience in Python.
- Understanding of performance and benchmarking measurement and optimization on collective communications and distributed at-scale model training.
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- Full-stack experience and understanding of AI/HPC systems, from hardware and infrastructure through the application layer, performance optimizations, including familiarity with relevant tools, libraries, and frameworks (e.g., NCCL, PyTorch, CUDA).
- Experience in one or more of the following machine learning/deep learning domains: hardware accelerators, AI Infrastructure, and/or high performance compute (HPC), particularly pertaining to interconnect and collective.
- Knowledge of AI/HPC hardware requirements and specifications (e.g., configuring hardware components, GPU, memory, network for AI/HPC workloads).
- Understanding of the transport stack (e.g., RoCE), its constraints and performance measures and how transport considerations enable the collective communications stack.
- Experience with SystemC.
Other Jobs from Meta
ASIC Engineer, Infra Silicon Enablement
Research Scientist Intern, Applied Vision (PhD)
Software Engineer (Leadership) - Product
Display Electrical Engineering Manager
Tooling Program Manager, Research Operations
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say